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I. 



INTROpUCTIO N AND LITERATURE REVIE W 



This first chapter consists of an introduction, a 
discussion of some of the notation and terminology used in 
subsequent chapters and a review of pertinent literature. 

The first section provides background information and 
motivation for the work represented by this thesis. The 
second section is a preview of the remainder of the thesis. 
The third section, together with Appendix A, provides the 
theoretical background which justifies development of 
balance equations in future chapters. The fourth section 
introduces some of xhe notation and terminology used 
throughout the thesis. Although some of the chapters may be 
read independently of each other, an understanding of the 
material in the fourth section is essential to all other 
chapters with the possible exception of Chapter III. The 
final section is a review of the literature pertinent to the 
subject of this thesis. 

1. INTRODUCIION 



The WWMCCS Computer Performance Evaluation Office of the 
Command and Control Technical Center provides support to the 
Joint. Chiefs of Staff (JCS) in assessing the capaoility of 
the Worldwide Military Command and Control System (WWMCCS) 
community to be responsive to the needs of the JCS. As a 
part of this support the WWMCCS Computer Performance 
Evaluation Office is responsible for the development, 
procurement and utilization of computer performance 
evaluation and measurement methodologies to investigate 



12 



computer system performance at WWMCCS sites. These 
methodologies are also to be used to predict the performance 
of proposed computer systems and the performance of existing 
computer systems under differing workloads and hardware 
configurations. 

One method available for attacking such problems is 
essentially trial and error. A proposed system can be 
assembled, or a proposed change to an existing system can be 
made, and the performance of the resulting system measured 
and evaluated. This is, however, a costly approach. Not 
only is the rental or purchase of the hardware prohibitive 
of such practices, but also the installation cost and time, 
and time spent in developing and implementing the required 
software, or changes in software, must be considered. 

A less costly alternative ro the trial and error method 
is a method in which performance measurements are made on an 
already-existing system which is identical to, or at least 
quite similar to, the proposed system. This is, perhaps, 
the best method. However, it is often not feasible. Either 
a similar system does not exist, or it is not possible to 
collect the needed data from it. Even when it is possible 
tc make measurements, the type of jobs which make up the 
lead offered to the existing system may differ drastically 
from the type cf jobs which will be offered zo the proposed 
system. Such differences in the job streams make 
conclusions drawn from collected data suspect. 

Two other methods involve modelling the proposed 
computer system as a network of queues. Such queuing models 
have been discussed by Gaver [36], ilcKinney [79], Shedler 
[100], Hyszewianski and Disney [113], and many others. Both 
analytic techniques and simulation have been used as 
solution procedures. 
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Analytic methods quite often require rather stringent 
assumptions which actual systems usually do not satisfy. On 
the other hand, they sometimes have a surprising (and, 
unfortunately, unpredictable) robustness which allows the 
differences between the assumptions and real life to be 
ignored. Although steady-state, or long-run average, 
measures cf performance are the most commonly attainable 
quantities, transient behavior is sometimes also derivable. 
However, this transient behavior depends upon starting 
conditions which, for most cases of interest in modelling a 
computer, are unknown. Fortunately, many potential 
problems, most notably bottlenecks, can be detected using 
steady-state measures. (The reader is referred, for 
example, to Euzen [16].) 

Simulaticn models have the advantage of providing an 
appealing pictorial similarity to the modelled system. By 
observing the event-by-event occurrences during a 
simulaticn, cne can "watch" jobs progress from queue to. 
queue. The transient behavior of the system can be examined 
quite explicitly. However, for exactly the reasons given in 
the preceding paragraph, the most useful measures derived 
from a simulation are the steady-state measures. 
Unfortunately, for models of any reasonable degree of 
complexity, accuracy in these steady-state measures requires 
rather large running times. (These times increase at least 
exponentially with complexity.) Besides the running-time 
problem, there is often the problem of determining when the 
effect of the initial conditions has died out, so that 
collection of such things as waiting times and queue lengths 
can begin. Many simulation models of networks of queues 
have been reported. The interested reader is referred to 
Cochi [26], Browne, Lan and Baskett [14], Cheng [24], Gaver 
and Shedler [41], Lavenberg and Shedler [76], and Querubin 
and Bamamoorthy [89]. This list of references, neither 
exhaustive ncr representative, consists of examples of 
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models of ccmputer systems. 



This thesis is concerned with the problems associated 
with development and numerical solution of analytic models. 
Therefore, reference will seldom be made to simulation 
efforts such as those cited above. The analytic method 
utilized throughout this thesis is an embedded Harkov chain 
approach which leads to a system of linear balance 
equations. These equations can be solved for the 
steady-state probability of finding the system in each 
state. appropriate weighted averages of these steady-state 
probabilities then reveal the desired measures of system 
performance. (See Kendall [65] for a discussion of this 
technique.) This technique has proved to be the most 
fruitful method available for deriving numerical results for 
queuing problems. Many of the references cited in the 
remainder of this chapter make use of this technique. 

Another method, one which will not be pursued here, is 
the use of diffusion approximations. The reader who is 
interested in the application of diffusion techniques to 
queuing networks is referred to Gaver [37], Gaver and 
Shedler [42], Gelenbe [45], Kobayashi [72,73] and Reiser and 
Kobayashi [ 94 ]. 

In the simplest of queuing models, a state can be 
conveniently represented by a single number. For example, 
consider a single queue having finite capacity, N. Any 
customer, or job, arriving at a time when there are N jobs 
at the queue, enqueued and in service, is immediately turned 
away and lost to the system. The states of the system are 
conveniently numbered 0, 1, 2, ..., N, where the number of 
each state is the number of jobs at the queue when the 
system is in that state. 

As the complexity of the model increases (for example. 
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more queues, different job types, more complicated arr 
and service schemes), so does the complexity of the st 
description and usually the number of states. A vecto 
representation is a necessary state description in man 
these more ccmplicated cases. Many examples will be g 
in Chapter II. Also, many of the references cited in 
final section of this chapter use vector representa tio 
However, the use of a vector representation produces s 
complications in solution procedures. For example, ho 
consideration of all states without possible duplicati 
made? When using a computer to arrive at a solution 
{usually a necessity as the state spaces become large) 
can the state descriptions and the balance equations b 
efficiently stored? 
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2. PREVIEW OE THE THESIS 



In recent years the discovery of whar has come to be 
called a product-form solution for certain classes of 
problems (see subsection 5.3) has greatly simplified the 
solution of such problems by making it unnecessary to store 
and solve the balance equations. For example, if A is any 
state, then the steady-state probability of finding the 
system in state A is found to be 



( 1 . 1 ) 



P (A) = c TT f. (A) 
1 



where C is 
= 1 to i = 

the system 



a normalization constant, the product runs from i 

the number of queues, and f is dependent upon 

i 

parameters and the character of the job stream at 



queue i, but not upon 
still necessary to be 



A in any other way. Note that it is 
able to consider all of the states 
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without duplication. 



A large class cf problems are not known to have a 
product form solution (or some other simple solution form) . 
These are considered by many to be unwieldy because of the 
type of analytic complications mentioned above. In Chapter 
II the complications resulting from vector representations 
are addressed and techniques are presented for finding 
numerical solutions for a wide variety of cases. 

As previously remarked, the primary motivation behind 
this thesis involves modelling of multiprogramming computer 
systems as networks of queues. Although many of the 
techniques discussed here are applicable to a more general 
class of network models, the emphasis is upon the 
central-server models (see Figure 1.1) , of which Baskett and 
Palacios [7] have said 

'•The central server queuing model seems to be an 
excellent model of multiprogramming even though 
the assumptions appear to be violated..." 

The physical properxies of a modern computer system 
limit the number of jobs which can be considered for 
processing at any given moment. Thus, even though there may 
be many jobs waiting for processing in an external "input 
queue," the computer itself might appropriately be modelled 
as a closed network of queues. 

Given such a model, the aarkov chain approach discussed 
in the next section can only be applied if a state space can 
be discovered such that the time between transitions among 
the states is exponentially distributed with a parameter 
which depends only upon the state the system is in during 
that time. Using vectors as state descriptors leads to such 
state spaces for a large class of models. Examples of 
vector state spaces used in this manner are found in many of 
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Figure l.l--Closed Central -Server Network of Queues 
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the references given in this chapter, as well as throughout 
the thesis. Once a state space has been determined, it is a 
relatively routine matter to write down the balance 
equations resulting from the Markovian assumptions. In 
principle the balance equations may then be solved for the 
steady-state probability associated with each state, and 
comp 9 site measures of system performance may be calculated 
by taking weighted sums of the state probabilities. 

However, the complexity of many interesting models 
results in a state space whose size precludes "by-hand” 
solution. In solving such problems on a high-speed digital 
computer, consideration must be given to the questions of 
representation, generation and storage of the states and the 
balance equations, as well as to the solution of the balance 
equations . 

The representation, generation and storage problems are 
addressed in Chapter II for a wide class of models for which 
a product-fora solution (or some other simple solution form) 
is not kncwn to exist. Each of these problems is considered 
first for a relatively simple central-server model (see 
Figure 1.1) with a single f irst-come-f irst-served 
exponential server at each queue and with two job types 
circulating among the queues. The changes necessitated by 
various changes in the model are then discussed. The model 
changes considered include introduction of generalized 
Erlangian service distributions, different queuing 
disciplines, multiple servers and capacities resulting in 
blocking (see section 4 of this chapter) . 

Methods for numerical solution of the balance equations 
are considered in Chapter III. In this chapter the cyclic 
nature of many problems is exploited. 

• Chapter IV reports on three computer programs which have 
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be€a written using the methodology discussed in Chapters II 
and III. This chapter also contains numerical results 
obtained by running these programs. These results may be 
used to compare the three models to each other as well as to 
a numerical example solved (using a product-form solution) 
by Baskett, Chandy, Muntz and Palacios [6]. Suggestions for 
approximating the models programmed by models which are 
amenable to a product-form solution may be tested using 
these programs. Some suggestions of this type are explored. 

Chapter V discusses a potential problem in gathering 
data on an actual computer system for comparison with the 
results of an analytic model. Certain data gathering 
techniques are found to be biased. Methods for estimating 
the biased results of these techniques using the results of 
the analytic model are discussed. 

In Chapter VI application of the work of Courtois [27] 
and others in decomposition of networks of queues is 
discussed for models which contain a central-server network 
as a submodel. In particular, the potential usefulness of 
solutions for the central-server submodel, as arrived at 
using either a producr-for m solution procedure or the 
techniques discussed in Chapters II and III, are considered. 
Two applications are discussed. 

Finally, Chapter VII summarizes the results of the 
earlier chapters and considers open questions and areas 
where further research needs to be done. 

Before proceeding with Chapter II, some of the concepts 
associated with the Markov chain approach to queuing network 
problems, as well as the major breakthroughs in applying 
this approach, are reviewed. By pointing out where these 
breaktnroughs cannot be applied, particular areas which 
require the techniques developed in Chapters II and III are 
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revealed. Tte next section deals with the Markov chain 
approach. Section 4 discusses the generalizations used in 
Chapter II as well as several of the references. The final 
section outlines the history of developments in solving 
Markovian queuing network problems and discusses the major 
breakthroughs. 



3. THE MAEKCV CHAIN APPROACH 



Networks of queues are generally classified as either 

open or closed. A closed network of queues is a collection 

of queues interconnected with a collection of paths. A 

fixed number of jobs, or customers, circulate among the 

queues alcng the paths. An example of such a network is 

represented in Figure 1.1. Upon completing service at one 

of the queues on the right (labelled Q,Q,...,Q),a job 

1 2 M 

progresses to the queue on the left, Q . Upon completion 

M+1 

of service at Q , the job progresses to Q with 

M+1 1 

probability p , to Q with probability p , and so forth. 

1 2 2 

Since the number of jobs circulating among the queues is 

fixed, p +p +... +p =1. 

1 2 M 

An open network of queues is similar to a closed network 

except that jobs can enter the network and depart from the 

network. Thus, the number of jobs need not be fixed. An 

example of such a nerwork is represented in Figure 1.2. In 

■chis example jobs can enter the network at any queue and 

depart the network from any queue. Upon completion of 

service at Q , for i = 1, 2, M, a job will proceed to 

i 
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Figure 1.2- -Open Central-Server Network of Queues 
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Q with prohability q or will leave the system with 
M + 1 i1 



probability q = 1 - q . Similarly, from Q a job will 
i2 i1 M+1 

proceed to Q with probability p or will leave the system 
i i 

with probability p =1-p -...-p. 

M+1 1 M 

The models depicted in Figures 1.1 and 1.2 are called 

central- server models, Q being called the central server. 

M+1 

This thesis will concentrate on applications of 
central-server models for which the central server 
represents a central processing unit (CPU), and the other 
servers, or queues, represent peripheral units (PP's) , such 
as tape drives, disc drives, and so forth. 

Consider now an arbitrary network of M queues. Suppose 

that a state space S has been chosen as descriptive of the 

possible states in which the system may be found as jobs 

(possibly) enter the system, move from queue to queue and 

(possibly) leave the system. Suppose further that S is 

discrete in the sense that the system remains in any state i 

in S for a positive time T before entering another state. 

i 

(That is, let T be the length of time from transition into 
i 

state i to the next transition out of state i. Then F_(0+) 

= 0 where F -(t) is the disrribution funcrion of the random 
i 

variable T .) If the distribution of T depends only upon 
i i 

the state i of the system during the time period, and if the 
probability g that from state i the system will next enter 

ij 

state j depends only on the states i and j, then the process 
is a semi-Markov process and a Markov chain is embedded in 
it. In this case, the future of the process is independent 
of the past if the current state of the system and the time 
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lie'll 





the system has 
since the last 
the time until 
therefore, not 
transition. 



been in that state are both known. The time 
transition is necessary, in general, since 
the next transition is a ''residual life" and, 
independent, of the time since the last 



However, if T is distributed exponentially, the time 
i 

until the next transition is distributed identically with T 

i 

(given the system is in state i) and is independent of the 
length of time the system has been in that state. In this 
case the process is a Markov process. The Komolgorov 
differential equations lead (assuming the existence of a 
steady-state distribution) to a system of linear balance 
equations of the form: 



(1.2) cP = 7 cgP 

i i j ji j 

where, for each i in S, P is the steady-state probability 

i 

that the system is in state i. (Note that there is an 

implicit assumption here that S has at most countably many 
elements. Otherwise, the sum is not well-defined. In most 
queuing applications exponential service times will 
guarantee countability of S.) An outline of the theoretical 
considerations leading to the development of such balance 
equations is given in Appendix A. 

Explicit examples of how a state space S may be chcsen 

in order to ensure that the times between transitions are 

exponentially distributed will be given throughout this 

thesis. In most cases the proper choice for the c 's and 

i 

q 's for use in (1.2) will be discussed. 

ij 
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4. GENEBfiLIZAIIONS FOR USE IN CHAPTER II 

In this section the generalizations used in Chapte 
are introduced. These include the generalizations of 
service distributions in subsection 4.1, the queuing 
disciplines in subsection 4.2, the number of servers a 
single queue in subsection 4.3 and the capacity of que 
subsection 4.4 The definitions in this section are 
essential tc a proper understanding of the remainder o 
thesis. 

4.1. Generali z^ Irian^ ian Service Distr iburi ons 

At first glance it may appear as though the de 
to find state spaces which ensure exponential times be 
transitions will force the exclusive use of exponentia 
service distribu ticns and Poisson arrival processes. 
However, the me thod of stages as introduced by Erlang 
and extended by Jensen [60] and Cox [28] provides the 
opportunity to use a class of non-exponential distribu 
with widely varying properties. 

Erlang suggested the use of, say, k stages of 
service life, with the times spent in each stage havin 
independent identical exponential distribution with ra 
parameter c. The distriburion of the resulting servic 
times has come to be called an Erlang distribution wit 
parameter c and shape parameter k. It is proportional 
Chi-squared distribution with 2k degrees of freedom. 

Such a distribution is generated by a queuing 
network such as that depicted in Figure 1.3. A job re 
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service at the first stage (Q^), immediately proceeds to the 

second stage (Q^) for further service, and so forth through 

the k-th stage (Q ) . The service received in each stage is 

k 

distributed exponentially with rate parameter c. The total 

service time fcr the job is measured from the time it enters 
the first stage until it leaves the last stage. 

From Figure 1 . 3 it may appear as though, in a 
network of queues, each queue with Erlangian servers cculd 
be replaced by a series of queues having exponential 
servers. However, doing so could be misleading. For 
example, consider Figure 1.3 as depicting a network of k 
queues, and suppose that there are m jobs in the network. 
Then, in general, the m jobs could be distributed among the 
k queues in any manner whatsoever, and there would be a job 
(possibly more than one) in service at each queue which is 
not empty. Now consider Figure 1.3 as a representation of 
the k stages of an Erlang service distribution at a given 
queue. Suppose that the given queue contains m jobs. If 
the queue has only one server, rhen only one of the m jobs 
can be in service. Although the job in service can be in 
any of the k stages, the other m - 1 jobs must be awaiting 

service at the first stage (Q ) . Thus, it is possible to 

1 

have at most one job among the K - 1 stages Q to Q , and if 

2 k 

there is cne among these stages, then there can be no job in 

service at Q . This is a definite contrast to the network 

1 

of k queues represented by Figure 1.3. 



Expansion of a queuing network to include the stages 
of service at each queue having an Erlang service 
distribution can cause complications. These complications 
are unnecessary since all that is required is that the stage 
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of service for each job experiencing an Erlang service time 
be included in the description of any state. 



Example 1 . 1 

Consider a network consisting of' a single queue with 
a single exponential server and a Poisson arrival process 
(i.e., the classical M/a/1 queue) . Then the states of the 
system can be represented by the nonnegative integers. That 
is, if S (t) is the state of the system at time t, the 
possible values for S (t) are the nonnegative integers: 0, 

1, 2, .../ where S (t) = i means that there are i jobs in the 

queue at time t. (If i > 0, exactly one of these jobs is in 

service.) Note that after being in state i for an 
exponentially distributed time, the system will enter either 
state i - 1 (if i # 0) or state i + 1. 

Now suppose that instead of exponential service, jobs 
at the queue experience Erlang service with shape parameter 
k = 2. Then the stare of the system at time t can be 

described by a pair of integers S (t) = {i,j} where i = 0, 1, 

2, ... represents the number of jobs in the queue and j = 

1,2 represents the stage of service of the job in service. 
If i = 0, let j = 1 by convention. Then the possible stares 
of the system are: {0,1}, {1,1}, {1,2}, {2,1}, {2,2}, 

{3,1}, ... If S(t) = {0,1}, then after an exponentially 

distributed length of time the system will transition into 
state {1,1}. If S (t) = {i,l} for i 0, then after an 
exponentially distributed length of rime the system will 
transition either to state {i+1,1}, by having another job 
arrive, or to state {i,2} , by having the job in service 
"complete the first stage of service and enter the second 
stage." Similarly, if S (t) = {i,2} for i # 0, then after an 

exponentially distributed length of rime the system will 
transition either into state {i-1,1}, by having the job in 
service complete service and leave the system, or inro state 
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{i+1,2}, ty having another job arrive. 

Now suppose that the service distribution is 
exponential but that the interarrival times have an Erlang 
distribution with shape parameter k = 2. Then the state of 
the system at time t can be represented by a pair of 
integers S (t) = {i,j}/ where i = 0, 1, 2 , ... represents the 

number of jots at the gueue and j = 1/ 2 represents, the 
stage of interarrival. This is most easily visualized as a 
two gueue series network in which the first gueue always 
contains an infinite number of jobs and has a single 
two-stage Erlang server. The second gueue is the gueue of 
the original system and has a single exponential server. 
Since the number of jobs at the first gueue never changes, 
there is no need to include this number in a state 
description. However, since there are two stages in the 
service distribution at this first gueue, any state 
description should indicate the stage of service of the job 
in service there. This is what is represented by the second 
component of the proposed state description. The possible 
states are; {0,1}, {0,2}, {1,1}, {1,2}, {2,1}, ... From 

state S(t) = {0,1}, the system will transition after an 
exponentially distributed length of time to state {0,2}. 

From state {0,2}, the system will transition after an 
exponentially distributed length of time to state {1,1}. 

This last transition corresponds to arrival of a job at the 
original gueue. From state {i,1} with i 0, the system 
will transition after an exponentially distributed length of 
time either to {i,2}, by having the interarrival time enter 
the second stage, or to {i- 1 , 1} , by having the job in 
service at the (original) gueue complete service and leave 
the system. Similarly, from state {1,2} with i 0 , the 
system will transition after a exponentially distributed 
length of time to either {i+1,1}, by having another jot 
arrive at the gueue, or {i-1,2}, by having the job in 
service at the gueue complete service and leave the system. 

The case in which both the service distribution and the 
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interarrival distribution have Erlang distributions re 
a three-ccmpcnent state descriptor. Two components ar 
the stage of the service distribution and the interarr 
distribution, and the other component is for the numbe 
jobs at the queue. 



guires 
€ for 
ival 
r of 



The conclusion that the times between transitions 
are exponentially distributed in Example 1.1 is based upon 
the well-kncwn fact that the minimum of two (or more) 
exponential random variables is exponentially distributed 
with rate parameter which is equal to the sum of the rate 
parameters of the distributions of rhe two random variables. 
(This is easily proved using standard probabilistic 
techniques. It is done, though not pointed out, in Barlow 
and Proshan [4]. It is done for the case that the random 
variables are independent and identically distributed in 
Johnson and Kotz [61].) Note that, even in the simple cases 
discussed in Example 1.1, the rate parameter of the 
distribution cf time between transitions depends upon the 
state of the system during that time — at least up to whether 
or not there is a job at the queue. 

Jensen [60] discussed a generalization of the method 
of stages in which the rate parameters are allowed to vary 
from stage tc stage. Referring again to Figure 1.3, this 
involves allowing c # c for i j, where the distribution 

i j 

of service at stage i is exponential with rate parameter c . 

i 



Cox [28] carried the method of stages zo its 
ultimate by showing that any life-time random variable whose 
moment generating function is a ratio of polynomials can be 
modelled as a network of exponential stages as depicted in 
Figure 1.4. (X is a life-time random variable if Prob {X>0} 

= 1.) In Figure 1.4 a job experiences a zero service time 
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Figure 1 . 4 - -Network Representation of a Generalized Erlangian Distribution 



(nc service) with probability 1 - p . 

1 



For j < k - a job 



experiences exactly j stages of service (each of which is 

exponentially distributed with possibly different, rate 

parameters) with probability p p ...p (1-p ). A job 

12 j j+1 

experiences all k stages of service with probability 



Now let X be a life-time random variable having 
distrinution function F(x) and density (if it exists) f(x) . 
Then the cicment generating function of X is the 
Laplace-Stielt jes transform of F(x): 



(1.3) 



F* (s) 




dF (X) 



or, if the density function exists, the Laplace transform of 
f U) : 



( 1 . 4 ) 



F* (s) = j 6 f (X) dx 

*’o 



(If f(x) exists, the two integrals are identical.) F* (s) is 
said to be a rational function if it is the ratio of two 
polynomials in s. The fact that F (x) is the distribution 
function of a life-time random variable forces the degree of 
the polyncmial in the numerator to be no larger than rhe 
degree, say k, of the polynomial in the denominator. As a 
result, r* {s) can be expanded in partial fractions as: 



(1.5) F(s) = (1-? ) + 2 P.P..,- • -P. (1“P. J TTc./(c.+s) 

1 2 1 1+1 J ] 

1=1 3=1 



= 0, and where c , c 
k + 1 1 2 



c are the k roots of 
k 



where p 



f 



9 • * * t 



the polyncmial in the denominator of F*(s) . The moment 
generating function of a random variable having the 

-cx 

exponential distribution function, G (x) = 1-e , is: 

(1.6) G*(s) = c/(c+s) 

As a result, (1.5) is found to be the moment generating 

function of a random variable whose distribution can be 

depicted in Figure 1.4 with exponentially distributed 

-stages. In particular, the distribution at stage Q is 

i 

exponential with rate parameter c . 



Note that since, in general, a k- degree polynomial 

cannot be guaranteed to have all of its roots on the 

negative real axis, it is conceivable that life-time random 

variables exist whose moment generating functions are given 

by (1.5) with complex valued p ’s and c *s. Indeed, Cox 

i i 

gives the following example in which two of the c *s are 
\ i 



complex. 



Example 1.2 

Consider a life-rime random variable X with 
probability density function 

22 2 -ax 

(1.7) f(x) = [a(a +b ) /b ] e (1-cos bx) 

for X > 0 and f(x) = 0 for x < 0. Then the moment 
generating function of X is 

2 2 2 2 
(1-8) F* (s) = [a(a + b ) ] / {(a+s)[ (a+s) + b ]} 
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which is in the general form given in (1.5) with p = p = 

1 2 

p^ = 1 . Note that the denominator of F*(s) has two complex 
roots, a + hi and a-^bi. 

In commenting on the use of complex probabilities, 
Cox points out that the decomposition into stages is an 
artificiality, and that there is no real concern about the 
probabilities and rates connected with the individual 
stages. The concern is rather about the process as a whole. 
That is, Erob {X < x} is a real probability even though the 
the probability of X "lasting** two but not three stages may 
formally be complex. 

Ncnetheless, the use of the method of stages with 

complex rates and probabilities leads to complex 

coefficients in the balance equations (see equations (1.2)) 

even though the variables, the steady-state probabilities, 

are assumed to be real numbers between zero and one. Much 

of the numerical analysis presented in Chapter III of this 

thesis requires real positive rates and nonnegative p *s . 

i 

For this reason a tacit assumption will persist throughout 

that the c *s are real and positive and that the p *s are 
i i 

probabilities. To wit, the definition: 

A random variable X is said to have a genera l iz ed 

distr ibut ion if it can be represented as 

a series of stages as depicted in Figure 1.4 with 

p =1, 0<p <1 for i = 2, 3, ..., k, and the 

1 i 

holding time at stage Q exponentially distributed 

i 

with rate c >0 for i = 1, 2, ..., k. 
i 

(The restriction p = 1 avoids zero service times.) 
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Several other representations of service times in 
terms of exponential stages are possible. A "parallel 
stages" representation, as depicted in Figure 1.5, is one 
example. (Ccx mentions that this case is also derivable in 
terms of a rational moment generating function so long as 
the roots of the denominator polynomial, or equivalently, 
the rates of the exponential stages, are distinct.) Another 
series representation would allow the service time to begin 
in any stage (see Figure 1.6). Still another representation 
would allow "movement" from any stage to any other stage. 

Cox shows that this last case is equivalent to the case 
considered in Figure 1.4 with the c *s and p_’s 



appropriately redefined. 



Although any of these alternative representations 
could be considered as easily as the one defined above, only 
the cases covered by the definition given will be explicitly 
considered in this thesis. The reason for this is 
threefold. First, distributions as depicted in Figure 1.4 
have been considered by other authors. (See Baskett [5], 
Baskett, Chandy, Muntz and Palacios [6], Litzler and Womack 
[78] and Moore [81].) Second, considering a variety of 
representations cannot add clarity to the discussion of 
generalized Erlangian service distributions later in this 
thesis. And third, once the development of the balance 
equations has been mastered for the present case, extension 
to these ether representations is straightforward. 



4.2. ^iigui £3 Disc ipli nes 

An important property of each queue is its queuing 
discipline; that is, the rule or set of rules which 
determine the order in which arriving jobs are served. The 
queues encountered by people in their daily lives usually 
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Figure 1 . 5- -Parallel Network Representation of a 
Generalized Erlangian Distribution 
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Generalized Erlangian Distribution 



have some variation of the f irst-come-f irst-served queuing 
discipline discussed here. In such queues it is often 
impractical or imprudent to depart too far from such a 
queuing discipline. 

However/ quite a variety of queuing disciplines are 
feasible for use at the processing units of a high speed 
multiprogrammed computer system. For this reason much work/ 
both theoretical and empirical/ has been undertaken in an 
effort to determine the relative advantages and 
disadvantages of various queuing disciplines. No attempt is 
made here to review the literature in this area. The 
interested reader is referred to Kleinrock/ Muntz and Hsu 
[71]/ Litzler and Womack [78] and Sherman/ Easkett and 
Browne [101]. In this subsection the queuing disciplines to 
be studied in Chapter II are introduced. 

At a queue having a f irst- co me-^fir st-s ep yed qu suing 
discipline (hereafter referred to as FCFS ) , the arriving 
jets enter service in the same order that they arrive at the 
queue. Cne of the best examples of a FCFS queue is the 
check-out counter at an old ma-and-pa corner grocery. 
Ignoring the lane switching phenomena, the check-out 
counters of a modern grocery store form a parallel 
connection of FCFS queues. A customer joins one of the 
lines and awaits his or her turn to check out. 

At a queue having a las t-come- first -served que ui ng 
di sci pl ine (hereafter referred to as LCFS) , an available 
server always serves the job which has been waiting for the 
shortest period of time. This is in contrast to the FCFS 
queues where service will always be extended to the job 
which has had the longest wait. At a LCFS queue a 
newly-arriving job will join the front of the waiting line 
if all servers are occupied. As described here, the LCFS 
queuing discipline is nonpreempt ive in that the service 
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received by a job cannot be interrupted. However, two 
variations cf preemptive LCFS queuing disciplines will be 
considered in Chapter II. 

At a queue having a preemptive LCFS queuing 
disciplire, a newly-arriving job interrupts the service of 
seme other job if no servers are available. The job which 
has been preempted in this manner then joins the waiting 
line, usually at the front, to await an available server. 

At a queue having a la st-come-first-serve d- 
preemp ti ve-r esume (LCF SPB ) d isciplin e , a job whose service 
has been interrupted resumes service at the point at which 
the last interruption took place. Thus, the service time 
from resumption of service (given no further interruption) 
has a residual life-time distribution. In the case that the 
(uninterrupted) service times are exponentially distributed, 
these residual life-tiroes are identically distributed with 
the uninterrupted service times. In the case that the 
service times have a generalized Erlangian service 
distribution (see subsection 4.1), the job resumes service 
in the stage cf service it last occupied. 

A natural companion to the LCFSPR discipline is the 
last-Gome-f i r St- se rv ed-pr eemptive- rep eat ILCFSPRpti, qu euing 
di scip li ne in which a job must "start from scratch" each 
time it receives service. Unfortunately, such a discipline 
destroys the renewal character of the queuing system, and it 
is this renewal character which permits the use of the 
embedded Markov chain approach discussed in section 3. The 
reason for this, as pointed out by Gaver [35], is that the 
service time of a job whose service was previously 
interrupted is constrained to be at least as large as the 
amount of service rime received prior to the interruption. 

As a rough approximation to such a situation, however, 
LCFSPHpt will be used to refer to a preemptive LCFS queuing 
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discipline in which a previously interrupted job is 
constrained to experience all stages of service previously 
entered, but not constrained to spend the amount of tiae in 
a stage that it previously spent there. For example, 
suppose a LCFSPRpt queue has a generalized Erlangian service 
distribution as depicted in Figure 1.4 with k = 5 stages. 

If the service of a job is interrupted while it is in the 
third stage (Q^) , then that job is constrained to experience 

service "at” Q , Q and Q (unless again interrupted before 
1 2 3 

"entering” Q or Q ) each time it receives service, until it 
2 3 

finally completes its service time at that queue. In effect 

p , p and p are each restricted to be equal to one for 
1 2 3 

this job until its service time is complete and it leaves 

the queue. But each time it enters Q , for example, ix 

1 

receives a different (exponentially distributed) service 

time there. So defined, LCFSPRpt is identical to LCFSPR if 
the service times are exponentially distributed. 

A queue is said to have a c lass pr iority queu in g 
d iscipli ne, or simply a prio ri ty qu euing disc iplin e for 
short, if different classes of jobs enter the queue (or 
entering jobs are assigned to different classes) , and jobs 
in some class always receive service before jobs in some 
other class. Such disciplines exist in military life and 
fall under the category of "rank has its privileges." For 
example, if an admiral and an ensign are both waiting for a 
haircut in a military barber shop, the admiral will be 
served first, even if the ensign arrived first. In computer 
systems it may be desirable to give short-running jobs 
priority ever longer-running jobs at a central processing 
unit. This example points to the fact that differenx 
classes of jebs may have different cha racteristics at the 
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queue, for example, different service disciplines. 

As with the LCFS disciplines, priority queuing 
disciplines may be either preemptive or nonpreempt ive. The 
comments concerning LCFSPR and LCFSPHpt disciplines apply in 
this regard. Whether preemptive or nonpreemptive , if the 
queuing discipline is priority, the waiting jobs with 
highest priority will be at the front of the line; and those 
with the lowest priority will be at the back. 

Many priority schemes have been devised. Indeed, 
different computer manufacturers recommend different 
schemes, and different computer facilities having the same 
computer system often employ different schemes. In general, 
the techniques to be studied in Chapter II can be applied if 
the class, or priority, of a job does not change once it 
enters a queue. These techniques can also be applied in 
some cases in which priority changes take place. However, 
in other cases, such as the interesting ones suggested by 
Jackson [57] and Kleinrock and Finkelstein [69], the renewal 
character of the system is disturbed and the techniques 
cannot be applied. 

Many articles have been written about the so-called 
ro und- ro bin (HR) queu ing dis cipli nes in which each job at 
the queue is, in succession, given a burst of service cf 
fixed length q (or less if less is required to complete the 
service of the job) . Among the authors which compare the RR 
disciplines to other disciplines are Kleinrock [67], 
Kleinrock and Muntz [70] and Rasch [90]. The advantage of 
such a discipline is that short-running jobs do not wait in 
queues for excessive periods of time while long-running jobs 
are receiving service, and long-running jobs are not 
enqueued for excessive periods of time while short- running 
jots, given a higher priority, return to the queue time and 
time again keeping them at the back of the line. The size 
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of g should fce kept short enough to allow short-running jobs 
to pass through the queue in a reasonable time, and yet long 
enough to prevent system overhead (time spent shifting from 
job to job rather than working on jobs) from becoming 
excessive. System overhead is considered in the models by 
Heacox and Purdom [51]. Some of the literature on R8 
disciplines is concerned with ‘'optimal" choice of q. 

Unfortunately, the fixed length, q, of the service 
burst disturbs the renewal character of models of such 
queues. This makes a successful embedded Markov chain 
approach based upon a conxinuous time Markov process 
unlikely. Using some simplifying assumptions concerning the 
relationship between g and the service time distributions, a 
discrete time Markov chain approach can be used to analyze 
EE queues. Such an approach has been used by Bhandarkar 
[1C] and Easkett and Smith [8]. Discrete time parameter 
methods are not discussed in this thesis. The reader is 
referred to a series of articles by Neuts [82], Neuts and 
Klimko [83,84] and Neuts and Heimann [85]. 

However, totally ignoring system overhead and 
allowing the length of service burst g to approach zero, a 
"limiting" queuing discipline, referred to as processor 

results. One description of a PS discipline 
is: "In a period of length t, in which the number of jobs n 

at the queue does not change, each job receives service of 
length t/n; that is, each job receives (1/n)-th of the 
service effort of the queue." The effect of such a 
description is that if a job has service time distributed 
exponentially with rate c, its waiting and service time is 
distributed exponentially with rate nc so long as there are 
n jobs at the queue. It is in this context that the PS 
queuing discipline will be used in this thesis. 

A generalization of this concept is given by Kelly 
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£64]. O'Conovan [86] has suggested several models of a 
processor sharing gueue. A bibliography of models of 
processor sharing queues is contained in Babad [3]. 

4.3. Multiserver Qu eues 

Having discussed service distributions and queuing 
disciplines, consideration must now be given to another 
property of queues, the number of servers. Consider a queue 
which has m > 1 servers and one of the queuing disciplines, 
other than PS, discussed in the preceding subsection. Then, 
if there are n jobs at the gueue, x.he number of these jobs 
receiving service is n, if n < m, and m, if n > m. The 
number awaiting service is 0, if n < m, and n - m, if n > m. 

As pointed out above, the check-out counters of a 
large grocery store form a parallel system of PCFS queues. 

By way of contrast, some banks (and post offices) have a 
single waiting line for all tellers. The first customer in 
line moves to the next available teller. This is an example 
or a single multiserver FCFS gueue. 

Uhen the number of servers is at least as large as 
the maximum number of jobs which can be at the gueue at a 
time, the gueue has what is called an infinite ser ver (15) 

disc ipline . Note that this need not mean that there 
are an infinite number of servers if the gueue has a finite 
capacity, or if it is in a closed nerwork of queues. When 
the number of servers at a queue becomes (effectively) 
infinite, all queuing disciplines act the same way. This is 
why IS is considered to be a queuing discipline. Also, 
notice that a PS queuing discipline is an IS queuing 
discipline in which the service degrades each time another 
job arrives and improves each time one leaves. 
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On the other hand, a PS queue can also have multiple 
servers. For example, suppose a PS queue has m servers. 
Then, if there are n jobs at the queue, each job receives 
its full measure of service (as though the queue were an IS 
queue) if n < m, and (m/n) times its full measure of service 
if n > m. 



4.4. Capac ity a nd B lock ing 

One final property of a queue is its capacity, that 
is, the naximum number of jobs which can reside at the queue 
(both in service and enqueued) at any given time. In a 
single-queue system jobs which arrive when the queue is at 
capacity are turned away and lost to the system. Seme 
authors have incorporated rhis same idea into queuing 
networks . 



For example, Hannibalsson [49] examines a network of 

twe queues, as depicted in Figure 1.7, both having finite 

capacity. A newly arriving job enters the first queue (Q ) 

1 

if there are fewer than N jobs (the capacity of Q^) enqueued 

and in service there. If N jobs are at Q when a job 

1 

arrives, the job immediately leaves the system. (The inpur 

process is Poisson and both queues have FCFS exponential 
servers.) A job completing service at leaves the system 

with probability (1 - p) or proceeds to the second queue 



(Q^) with probability p. 

of Q ) already at Q when 
2 2 

system. A job completing 



If it finds M jobs (the capacity 

it arrives, it will leave the 

service at Q will return to Q 
2 1 
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where it will be treated as though it were a newly arriving 



jot . 



The advantage of imposing capacities on queues/ or 
cn a queuing network as a whole, is that doing so very often 
yields a finite state space. For example, in Hannibalsson' s 
model' (Figure 1.7) the states of the system can be 
described in terms of ordered pairs (n,m) of nonnegative 
integers such that n is no larger than N and m is nc larger 
than H. Thus, the number of states which need be considered 
is no larger than (N+1) x (a+1). 

In closed networks of queues this same advantage is 
enjoyed. The number of states is finite (in many cases) 
since the number of jobs is fixed. Reiser and Kobayashi 
[93] discusses a "semi-closed" network of queues in which 
the total number of jobs is constrained to be no larger than 
k" and no smaller than k'. A job is turned away if it 
attempts to enter the system when k" jobs are already there. 
Similarly, a job is not allowed to leave the system if there 
are k* jobs in the system (including it). These 
restrictions lead to a finite state space. 

Since a treatment such as that of Hannibalsson, in 
which any job encountering a queue an capacity leaves the 
system, is net usable in a closed network of queues, 
additional ways of handling queues with finite capacity in a 
network are needed. One possibility is to allow a job to 
"skip" any queue which is at capacity. For example, 
consider the closed network depicted in Figure 1.1 with N 
jobs circulating in the system. Further, supoose rhat Q 

M+ 1 

has a capacity of n < N jobs. Any job completing service 

ax, say, Q when there are n jobs at Q will skip Q , 

1 M+1 M+1 
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experiencing neither waiting time nor service time, and 

proceed immediately to with probability p^, with 

probability p , and so forth. If one or more of Q , Q , 

2 12 

Q are also at capacity, the analysis becomes a little 
M 

more complicated, but the idea is the same. The "service 

deletion" mechanism of Jackson [61] is equivalent to 
skipping as described here. 

Skipping seems a little arbitrary. However, another 

alternative exists, that of blocking. Again consider the 

example introduced in the preceding paragraph, except that 

jobs are not allowed to skip. Then a job completing service 

at Q when there are n jobs at Q is blocked from entering 
1 M + 1 

Q until a job leaves Q . Hence, it must remain at Q . 
ia+1 M+1 1 

In turn, the blocked job continues to occupy a server (the 

server, if there is only one) at Q^, thereby blocking jobs 

enqueued at Q from receiving service. (If not, Q could 
1 1 

serve as an "overflow waiting room" for Q and, in effect, 

M+ 1 

the capacity of Q would be increased.) A simple 

H + 1 

twc-queue model with blocking has been discussed by Adiri, 
Hofri and Yadin [1j. 

A prcblem which can arise in networks with blocking 
is deadlocking. Again consider the example discussed above. 
If is also at capacity and has a single server, no job 

can receive service at Q until a job leaves Q , 

1 M+1 

unblocking the job at Q , and allowing it to proceed tc 



47 



also has a single server, and that 



Q . Suppose that Q 
M+1 M+1 

a job completes service ar Q and is destined for Q . 

M+1 1 

Since is at capacity, this job is blocked and blocks any 

further service activity at Q . Thus, the two queues are 

M+1 

blocking each ether and no service activity can take place 
in either. This is called deadlocking. 

Gordon and Newell [^*7] discussed a closed cyclic 
network cf queues (jobs leaving queue i proceed only tc 
queue i + 1; jobs leaving the last queue proceed only to the 
first queue) in which one or more queues have finite 
capacity and blocking results. However, as formulated by 
Gordon and Newel, a blocked job must repeat service while 
waiting tc be unblocked. Hence, unlike the blocking model 
discussed above, only one job can move between queues at any 
instant. Results are given only for some special cases of 
the cyclic model discussed. This and the fact that very few 
authors address blocking bear testimony to the analyric 
difficulty of the subject. 

Nonetheless, models involving blocking and queues 
having finite capacity are appropriate in the computer 
world. Deadlocking, for example, is discussed by Havender 
[50] as a real problem in multiprogr ammed computers. In 
Chapter II some space will be devoted to modelling in cases 
involving blocking and queues with finite capacity. 

5. REVIEW OE QUEUING NETWORK LITERATURE 



This section consists of a review of the major papers on 
the subject cf queuing networks, particularly Markovian 
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queuing networks. No attempt has been made to be exhaustive 
in this review. Indeed, in many cases where some particular 
work summarizes and extends earlier works, only the most 
recent (or, in some sense, the most complete) is cited. 

These interested in extensive research into this subject 
could use the references in this section and earlier 
sections as a starting point and investigate the references 
in the cited works for further depth and background. 

E. A. Torbett [105] in an appendix to his PhD thesis has 
an excellent review of some of the early research in queuing 
networks. The first subsection of this section parallels 
Torbett's appendix, although the emphasis here differs 
somewhat from that in Torbett's work. 

The remaining subsections are arranged mere by problem 
approach than chronologically. A very good review of this 
material has recently been published by Lemoine [77]. 

5.1. fac 1^ Wo rk 

One of the most important features of any network of 
queues is that the output stream from certain of the queues 
contributes to the input stream at certain other queues. As 
a result it is hoped that known results concerning 
individual queues will be applicable in studying networks of 
such queues. In this regard it is apparent that some 
knowledge of the departure process of an individual queue is 
required . 

Burke [15] took the first step in setting the stage 
for the study of Markovian queuing networks. He 
investigated the departure process from a single FC?S queue 
with multiple (identical) exponential servers given a 
Poisson arrival process. Assuming that the traffic 
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intensity (the ratio of the arrival rate to the maximuai 
service rate) is less than one, so that steady-state exists, 
Burke proved that the steady-state departure process is 
identical to the Poisson arrival process and independent of 
the gueue size left by a departing job. Since the 
steady-state distribution of the gueue size for the M/M/s 
queue is well-known (see Kleinrock [68], for example), the 
implicaticn of this result to an open network consisting of 
a series connection of such queues is immediate. 

Consider a series connection of N queues. For each 

i = 1, 2, ..., N, suppose that gueue i has s identical 

i 

exponential servers. Suppose further that jobs enter the 

system in a Poisson arrival stream at the first queue and 

depart from the system in a departure stream from the M-th 

gueue after experiencing service at each queue in 

succession. Also suppose that the traffic intensity at each 

queue is less than one if the rate of the Poisson arrival 

stream is used in each calculation. Burke's results show 

that the input process for each of the N queues is identical 

to the input process to the system. Thus, the i-th gueue is 

an M/tS/s gueue with known interarrival and service 
i 

distributions. The independence proved by Burke indicates 

that the joint steady-state probability of finding n_ jobs 

at queue i for i = 1, 2, ..., N is the product over i of the 

steady-state probabilities of finding n jobs at queue i, 

i 

considered as an individual a/M/s queue. This result was 

i 

displayed by R. fi. P. Jackson [59]. 



Since the superposition of independent Poisson 
processes is a Poisson process, this result extends easily 



50 



tc networks in which several (or all) of the queues 
experience arrivals frcm outside the network. The fact that 
prchabilist ic filtering of a Poisson process results in a 
Poisson process facilitates extension to nonserial networks 
so long as there is no feedback. (A queuing network has 
feedback if paths exist which could allow a job to return to 
a queue at which it has already received service.) 

Indeed, even feedback can be accommodated, except 
that the- rate of the input process to some of the queues is 
not so readily ascertainable. J. R. Jackson [56] considered 
a rather general open network of N queues in which, for i = 

1, ..., N, 

1. Queue i has s identical exponential servers. 

i 

2. Jobs from outside the network arrive at queue 

i in a Pcisson stream with rate r . 

i 

3. All queuing disciplines are FCFS. 

4. Jobs departing queue i route (instantaneously) 
to queue j with probability p or leave the 

ij 

system with probability 1 - (p + p + ... + 

i1 i2 

P.„) • 

iN 

Noting that the arrival process to any given queue is the 

superposition cf arrivals from outside the system and 

arrivals frcm each of the N queues within the system, 

Jackson defined the average arrival rate of jobs at queue i 

to be R and stated that in steady-state 
i 



(1.9) 



N 



R = r + 2 P 

i i . , ji : 
3 = 1 



He then proceeded to prove that the joint steady-state 
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probability of finding n jobs at queue i for i = 1, 2 , 

i 

N is the product over i of the steady-state probability of 

finding n jobs at an M/M/s queue with arrival rate R and 
i i i 

service rate equal to the service rate at queue i. This is 

exactly the result given by R. R. P. Jackson, but with the 

arrival rates determined by solving the system of linear 

equations (1.9). Note that, with r =0 and the sum over j 

i 

of p equal to one, Jackson's model is a closed network of 

i j 

queues. This offers some hope that a product-form solution 
may also be valid for closed networks. 

Burke proved his results (see the beginning of this 
subsection) under the assumptions of an homogeneous Poisson 
arrival process, identical exponential servers, an infinite 
capacity and identical jobs. With reference to the 
discussion in the preceding section, it is natural to ask 
questions concerning extensions of Burke's results. For 
example, what can be said about the departure process if the 
rate of the arrival process depends upon the number of jobs 
at the queue; or if the rate of service is a more general 
function of the number of jobs; or if the queue has a finite 
capacity; or if the interarrival or service times have 
generalized Erlangian distributions; or if arriving jots are 
of different classes, with the arrival process and service 
distributions differing with class? Few of these questions 
have been answered. 

Reich [91] extended Burke's work to show that the 
service time distribution must be either exponential or a 
step function at zero (thar is, no service time at all) if 
both input and output processes are to be Poisson. Finch 
[34] showed that, in order for the departure process to be 
independent of the queue size left by a departing job, the 
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service tines must be exponential and the queue cannot have 
finite capacity. Beich also showed that, if the 
interarrival times and service times both have two-stage 
Erlang distributions, the output process of a single server 
queue is not identical to the input process. These results 
hold forth little hope for finding product-form solutions 
fcr networks of queues incorporating some of the 
generalizations discussed in the preceding section. 

However, it must be noted that. H. R. P. Jackson, 
being most piobably unaware of Burke's result which was 
published later than his own, did not use Burke's result. 
Furthermore, although J. B. Jackson acknowledged both 
Burke's paper and Reich's, he used the same solution method 
as did E. B. P. Jackson. Both authors derived the 
equilibrium balance equations in the usual manner (see 
Appendix A) , stated the solution and proved that the 
solution satisfied the equations by simple substitution. 

This same method was used by Koenigsberg [74], who 
was one of the first to exhibit a product-form solution for 
a closed network of queues, and by Finch [33], whose model 
included a ncnhomogeneous Poisson arrival process. 
Koenigsberg ' s model, termed a cyclic 3 ueue, consisted of a 
series of queues in which jobs leaving the last queue 
entered tte first queue. This is a submodel of one of the 
two models considered by Finch. Finch's models, both of 
which were open networks, incorporated simple forms of 
feedback. Jobs arrived at the network in an homogeneous 
Poisson arrival stream, bur were turned away whenever the 
network contained a specified number of jobs (the capacity 
of the network) . In this way Finch used a ncnhomogeneous 
Poisson arrival process to maintain a finite state space. 

Gordon and Newell [46] showed that, because of the 
form of the ncnhomogeneous Poisson arrival process used. 



53 



Finch's mcdels are equivalent to closed queuing network 
models ccrtaining one additional queue. This queue has a 
single exponential server with service rate equal to the 
rate of the Ecisson arrival process of the equivalent open 
model. At anj instant the number of jobs at this auxiliary 
queue may be determined by subtracting the number of jobs in 
the remainder of the network from the network capacity. 

Thus, when the network is at capacity, there are no jots in 
the auxiliary queue and there can be no "arrivals." On the 
other hand, when the network is below capacity, there is a 
job in service at the auxiliary queue and "arrivals" occur 
at the appropriate rate. 

In this same paper Gordon and Newell went on tc 

consider an arbitrary closed network of M queues with N 

identical jots circulating in it. For each i = 1, 2, ..., 

M, queue i is assumed to have an arbitrary number, r , of 

i 

identical exponential servers. Indeed, their model is 

exactly the model of J. R. Jackson discussed earlier with 
exogenous arrival rates all set equal to zero and with the 
sum over j of p equal to one. After deriving the balance 

ij 

equations, they assumed a product-form solution and proved 

that the jcint steady-state probability of finding n jobs 

i 

in queue i for i = 1, 2, ...» M is given by 



( 1 . 10 ) 



-1 n; 

F(n ,n ,...,n ) = G (N) ? x ' /b (n ) 

12 H “ i i i 

i=1 



ifn +n + ... +n =N, andP(n,n,...,n) =0 

12 li ^ 2 a 

-1 

otherwise. In (1.10) G (N) is a normalizing constant 
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which depends upon the number of jobs in the network. 



(1.11) b.(n.) 

1 X 




if n < r 
i i 



if n > r 
i i 



and X is prcpcrtional to the probabiliry of finding a job 
i 

in queue i in the special case that N = 1. The x can be 

i 

found by solving the (usually much smaller) system of 
equations; 



( 1 . 12 ) 



M 

m X = 2 m p X 

i i . , j ji j 



where m is the rate of service for a single server at queue 
i 

i and p is the probability that a job leaving queue j will 

ji 

next enter queue i. 



Gordon and Newell's model is a special case of a 
very general model considered in an earlier work by J. R. 
Jackson [58], of which Gordon and Newell were apparently 
unaware. Jackson's new model consisted of an arbitrary 
network of queues similar to that considered in his earlier 
work (in [56]) discussed earlier in this section. Unlike 
his earlier model, however, this new model assumed jobs 
entered the network in a Poisson arrival stream whose rate 
depended upcn the total number of jobs in the network. The 
choice of queue at which to enter was made in a Markovian 
manner according to a fixed probability distribution. 
Service at the queues was exponential with a rate which 
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I 



depended not only on the queue, but also on the number of 
jots at the queue. 

In addition, Jackson incorporated two mechanisms, a 
'•triggered arrival” mechanism and a "service deletion" 
mechanism. The triggered arrival mechanism provided for a 
lower -bound on the total number of jobs in the network. 

When the number of jobs in the network equaled this lower 
bound, a departing job was immediately reinserted into the 
network as a newly arriving job. 

The service deletion mechanism provided for finite 
capacity at individual queues. This was accomplished hy 
automatically ejecting the job in service at a queue 
whenever a new job arrived and found the queue at capacity. 
Thus ejected, the job proceeded to another queue or left the 
network as though it had just completed its service. As 
pointed cut in subsection 4.4 of this chapter, this is 
equivalent to a job bypassing any queue at capacity that it 
encounters. 

Jackson showed in [58] that the balance equations 
for the steady-state probabilities have a product-form 
solution. Like Gordon and Newell, he showed that the 
factors of this product-form solution can be determined by 
solving a considerably smaller set of linear equations. 
However, because of the generality of the model, Jackson's 
solution procedure is more difficult to apply than the 
similar procedure described by Gordon and Newell. For this 
reason it is Gordon and Newell's modal, rather than 
Jackson's, which has been widely examined and applied. 
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5.2. APFlications to Com£u te r Systems 

Spencer and Sheng [103] have reported using 
Jackson's earlier model in a study of multiprogrammed 
time-sharing computer systems. Avi-Itzhak and Heyman [2], 
Buzen [17], Chiu, Durmcnt and Wood [25] and Kaneko [621 have 
used the results in Gordon and Newell's paper [46] to 
investigate various aspects of a multiprogrammed computer 
system. Torhett [105] used Gordon and Newell's result in an 
examination cf optimal control of closed queuing networks 
with adjustahle service rates. 

Gaver [38] modelled a multiprogrammed computer 
system as a closed central server network of single server 
queues as depicted in Figure 1,1. He then showed that, for 
this model, Gordon and Newell's result (1.10) reduces to 

M 

-1 n: 

(1.13) P (n^,n^,.. . = G (N) TTx^' 

i=1 



where for each i = 1, 2, ..., M, 

(1.14) X = m p/m 

i M+ 1 i i 

where m is the exponential service rate at Q and p is as 
i i i 

in Figure 1.1. He then used this formulaticn to derive 

-1 

G (N) and several measures of system performance, such as 

the idleness probability for each of the queues. The 
solution given by (1. 13) will be used in later chapters for 
ccmpariscn with nuaerical results using the methodology 
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developed in Chapter II. 



In principle, the value of G (N) in (1.10) can be 
determined hj appealing to the normality condition, which 
says that the sum of steady-state probabilities is unity. 

That is, once x has been determined from (1.12) , 

i 



G(N)P(n ,...,n ) can be calculated (see (1.10)) for each 
1 M 

state. These quantities are then added over all stares to 



yield G (N) . Unfortunately, as N and M become large, the 
total number of states becomes large combinatorially . This 
forces the use of digital computers. Special pains must be 
taken to see that too much is net lost due to round off. 
This problem is discussed in Dorn and KcCracken £291. 



Euzen £18] studied Gordon and Newell's product-form 
result for closed queuing networks and developed an 

algorithm for determining G (N) using the x 's from (1.12). 

i 

Intermediate results are G(1), G(2), ..., G(N-1). He then 

went on ro display composite measures, such as the marginal 

distributions, P{n = k}, in terms of G(1), G(2), ..., G(N). 

i 

This algorithm solved some of the computational problems 
facing those who wish to use Gordon and Newell's results. A 
good review of this algorithm and several extensions have 
been giver by Bhandiwad and Williams £11]. 



5.3. Local Bala nce and Quasi- reversi b ility 



In the same year that Gordon and Newell's much-cited 
work £46] was published, ?. Whittle [109] presented the same 
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results at the thirty-sixth session of the International 
Statistical Institute. The next year an article by Whittle 
[110] was published in which he displayed a product-form 
solution for open queuing networks, reproducing the results 
of Jackscn [58] (of which he, too, was apparently unaware). 
While Jackson's work was remarkable for its generality, and 
Gordon and Newell's for its simplicity, the impact of 
Whittle's work has been the introduction of the concept of 
loc^l balance, also referred to as indiv id ual bala n ce and 
ba la n c e. 

The balance equations considered in earlier works 
(hereafter referred to as global balance equations tc 
differentiate them from the local balance equations) equate 
the weighted rate of flow from a given state to the weighted 
rate of flow into that state. The weights are merely the 
steady-stare probabilities of finding the system in the 
appropriate state. For example, the global balance 
equations shewn by Gordon and Newell to have (1.10) as their 
solution are 

M 

(1.15) { 2 min (n ,r )m } P (n ,...,n ) = 

i i i 1 M 

i=1 

M 

2 {min (n + 1 ,r ) m p } 
i i i ij 

j = 1 

xP(n ,...,n — 1,...,n ^1,...,n ) 

1 j i M 

where m is the rate of the exponential service times at 
i 

queue i, r is the number of servers at queue i, and p is 

i ij 

the probability that a job completing service at queue i 
will next route to queue j. 



M 

2 

i=1 
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The local balance equations equate the weighted rate 
(same wieqhts) of flow from a state due to departure from a 
given queue to the weighted rate of flow into the state due 
to an arrival at the same queue. The local balance 
equations corresponding to (1.15) are 

(1.16) • ain (n ,r )m P (n ,...,n ) = 

i i i 1 M 

M 

y {min (n + 1 ,r ) m p } 

3 3 j 3i 

3=1 

xP(n ,...,n +1,...,n -1,...,n ) 

1 \ j i a 

for each i = 1, 2, ..., M such that n #0. 

i 

Summing (1.16) over i yields (1.15). That is, if a 
candidate solution satisfies the local balance equations, it 
also satisfies the global balance equations. Whittle showed 
that a given product form is the solution to the appropriate 
set of global balance equations by showing that it is tne 
solution tc the corresponding set of local balance 
equations. This same technique has been used by several 
authors since to prove that a product form solution exists 
for a variety of network problems. 

Combining and extending the results of several 
papers, Baskett, Chandy, Muntz and Palacios [6] considered a 
general queuing network as is now described. The number of 
queues is arbitrary, but finite. There are an arbitrary, 
but finite, number of job types. The probability that a job 
of type r completing service at queue i will next enter 
queue j as a job or type s is fixed. The system may be 
either open or closed. If open, arrivals to the network 
occur in a Poisson arrival stream with rate dependent upon 
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the total numher of jofcs in the system, or in a collection 
of Poisson arrival streams, one for each ergodic subchain of 
the Markovian switching matrix, with the rate of each stream 
dependent upon the number of jobs in the corresponding 
subchain. An arrival enters the system at a given queue as 
a given type job with fixed probability. Each queue is one 
of the following four types: 

Type A FCfS queue with exponential service 

having rate dependent upon the number of jobs in 
the queue, but not their types. 

2: A single server PS queue with generalized 

Erlangian service, dependent upon job type. 

IS queue with generalized Erlangian 
service, dependent upon job type. 

T yp e 4: A single server LCFSPa queue with 

generalized Erlangian service, dependent upon job 
type. 

Using the local balance technique of Whittle, 
Baskett, et. al. showed the steady-state solution to have 
the product form; 

M 

(1.17) P {S = ( 2 . ,. ..,n ) } = c d (S) TT f. (n. ) 

X H XX 

i = 1 

where M is the number of queues, n is a vector whose 

i 

contents and structure depend upon the type of queue i, c is 

a normalization constant, i(S) is a constant which depends 

upon the form of the arrival process (d(S) = 1 if the system 

is closed) , and each f (n ) is a product whose form depends 

i i 

upon the type of queue i as well as the contents of n . 

i 
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Chandy, Keller and Browne [22] have reported the 
development c£ a computer program, referred to as ASQ (for 
"algebraic scluticn for queues") , which provides both 
numerical and algebraic solutions to many types of queuing 
network models. The models which are amenable to analysis 
using ASQ are submodels of the general model of Baskett, 
Chandy> Muntz and Ealacios. Recently Reiser [92] has 
reported the development of an interactive computer program, 
referred to as QNET4, which provides numerical solution to a 
slightly broader class of models. 

Computation simplifying techniques and algorithms, 
some of which are extensions of Buzen's work [18], have been 
proposed for many of the product form solutions proved by 
Baskett, et. al. Some of these proposals have come from 
Bhandiwad and Williams [11,112], Hine and Fitzwater [52] and 
Reiser and Kcbayashi [95,96]. 

A comprehensive treatment of local balance has been 
given by Kingman [66], who also discusses a stronger 
property called reve rsi bil it y . A Markovian queuing network 
is said to be reversibile if the (unconditional) rate of 
transition from a state C to any other state D is equal to 
the rate of transition from D to C. Letting q{C,D) be the 
conditional rate of transition from C to D, and letting p (C) 
be the steady-state probability associated with state C, the 
reversibility condition is 

(1.18) F(C)q(C,D) = p(C)q(D,C) 

for each fair of states C and D. Reviewing the definition 
of local balance, it is evident that every reversible system 
satisfies the conditions of local balance. Kingman proved 
that a product form solution exists for each irreducible 
reversible sjstem. 
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It is easily seen that local balance is not 
equivalent to reversibility. The tandem queuing models of 
fi. E. P. Jackson £59], which are discussed in subsection 
5.1, are examples of irreversible systems which satisfy the 
conditions cf local balance. Thus, the work of Whittle and 
others cited above show that reducibility , though 
sufficient, is not necessary for the existence of a product- 
form solution. 

The wide use of the local-balance technique for 
proving a product-form solution, and the lack of proofs 
based on weaker criteria, lead naturally to a conjecture 
that local balance is necessary and sufficient for existence 
of a product-form solution. However, neither necessity nor 
sufficiency has been proved. 

Very recently Kelly [64] proved the existence cf a 
product-form solution for an open network of queues with 
multiple jcb types using a property he terms 
qua si- re versib ilit y. Supposing a Poisson arrival process 
and classification of departing jobs into groups according 
tc the jet's experience in the network (for example, its 
service time) , Kelly defined a network to be 
quasi-reversitle if departures of the different groups of 
jots form independent Poisson processes and if the state of 
the network at any time is independent of departures up to 
that time. Although quasi- reversibility implies 
reversibility, the relationship between guasi-reversibilit y 
and local balance is unknown. 

Defining the conditional rate of departure from 
state C, q (C) , by ; 
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(1.19) g(C) = 2 g(C,D) 

D?tC 

Kelly proved (see Lemma 1 of [64]) that: 

If there is a set of positive numbers {p(C)} 
adding to unity, and a set of nonnegative numbers 
{g* (C,D)} such that 

(1.20) g(C) = 2 g'(C,D) 

D#C 



and 

(1.21) F(C)g(C,D) = p(D)g' (D,C) 

hold for all states C and D, then {p(C)) is the 
steady-state disxriburion which satisfies the 
global balance equations for the system. 

He then considered several open queuing network models in 
which an entering job was classified by type according to 
the route (finite and fixed as opposed to the Markovian 
branching probabilities discussed by other authors) it would 
take through the network and, in some cases, the service 
distribution it would experience at each queue it visited. 

In each model he then established the existence of a 
prcduct-f or m solution by developing the global balance 
equations, guessing values for p (C) and q'(C,D), proving 
that (1.20) and (1.21) are satisfied, and appealing to the 
lemma summarized above. The values guessed for p(C) and 
g'(C,D) were made more natural by the relationship between 
reversibility and guasi-reversibility . No further details 
will be given here. The interested reader is urged to read 
Kelly's very interesting paper [64]. 

Not only is the relationship between local balance 
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and guasi-reversibility not clear, but also the 
applicability cf the local- balance technique to Kelly's 
models, and the applicability of Kelly's quasi-reversibility 
techniques tc models whose product-form solution have been 
proved using the local-balance technique are not clear. For 
example, it is not known whether the guasi-reversibility 
techniques are applicable to closed queuing networks. The 
models considered in this thesis are more easily related to 
the models of Whittle and Baskett, Chandy, Muntz and 
Palacios than those of Kelly. 

In searching for a product-form solution based upon 
local balance, the form of the local balance equaricns often 
suggests the form of the solution. For example, since the 
sum over j of the p is equal to unity in the closed model 

ij 

of Gordon and Newell discussed above, dividing the right 
side of (1.16) by the left-hand side suggests that 

(1.22) E (n ,...,n +1,...,n -1,...,n ) = 

1 j i M 

{[min(n ,r )m p ] / [ min (n +1,r )m p ]} 
i i i ij 3 J j ji 

xP (n , . . . ,n ) 

1 M 

Assuming a product form such as 



(1. 23) p (n ,. . . ,n ) 

1 M 



M n , 

TT (Tt'‘ y . ) 

ik 

i=1 k=1 



leads to the following relationship 



(1.24) iiin (n + 1 , r ) p y 

j j ji + l 



min (n ,r ) ra p y 

i i i ij i,n 

i 



6S 



"factoring out" the dependence of y upon n by letting 

i 



( 1 . 25) X = min (n ,r ) y 

i i i i/A 



result? in 



(1.26) 



m p X 

d ji j 



m p X 
i ij i 



Summing (1.26) over j yields (1.12). The result (1.10) of 

Gordon and Newell’s analysis fellows directly, since b (n ) 

i i 

incorporates the relationship given in (1.25). 



However, it is not difficult to devise examples in 
which local balance does not hold. For example, Sauer and 
Chandy [93] consider a closed network consisting of twe FCFS 
queues, each having a single exponential server. Two jobs 
circulate in the network. The lack of local balance is a 
resulr of the fact that the two jobs have different service 
rates at cne of the queues. 

Since product form solutions seem to be closely 
related to local balance, attempts to determine steady-state 
solutions fer networks whxch do not satisfy the local- 
balance conditions either have been numerical in nature or 
have involved approximation by models which do satisfy the 
local-balance conditions. A brief summary cf such efforts 
is given in the next subsection. 



5.U. Num eri cal and App roxi matio n M ethod s 



Eased on Norton’s theorem in electrical circuit 
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theory, Chandy, Herzog and Woo [20] developed a technigue 
for examination of networks of queues in steady-state. This 
technigue, which is particularly suited to examination of 
the behavior cf a subsystem as its paramerers are varied, is 
best explained in terms of an example. So, consider Figure 
1.8. For simplicity suppose that N jobs of a single type 
circulate among the queues, that each queue has a single 
FCFS exponential server, and that the choice of routing to 

C , Q or Q is done in a Markovian manner according to a 
4 5 6 

fixed probability distribution. That is, suppose that the 

steady-state probability distribution could be determined 
using the wcrk of Gordon and Newell [46]. Further, suppose 
that the (marginal) performance of the parallel subsystem of 
queues, bounded by points A and B in Figure 1.8, is of 
particular interest. 

The first step in examining this system is to "short 

out" the subsystem as depicted in Figure 1.9. (This is 

equivalent to replacing the subsystem by a queue with zero 

service time.) Next, for each n = 1, 2, ..., N, the 

"shorted" system of Figure 1.9 is solved to determine the 

rate T (n) at which jobs pass through the short (from A to B) 

when there are n jobs in the system. (For this example the 

work of Kcenigsberg [74], discussed in subsection 5.1, is 

applicable.) Finally, returning to Figure 1.8, the portion 

of the original network exterior to the subsystem of 

interest is replaced by a single composite queue, Q , having 

c 

a state dependent service rate T(n). The analogue of 
Norton's theorem states that the behavior of the subsystem 
in the resulting simplified system (Figure 1.10) is the same 
as its behavior in the original system (Figure 1.8) . 

Note that this technique reduces the analysis cf a 
rather large, complex system to the analysis of N + 1 
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Figure 1 . 10- -Network of Figure 1.8 with Composite Queue Replacing 




smaller, simpler systems. Also, note that the state 

dependent service rates, I (n) , of the composite queue, Q , 

c 

are not affected by changes in the values of parameters 

within the subsystem. Thus, analysis of the sensitivity of 

the behavior of the subsystem to changes in the service 

rates at C / Q and Q and changes in the routing 
4 5 6 

probabilities can be accomplished through multiple solution 

of the central-server system. Figure 1.10, rather than the 
entire complex system. Figure 1.8. 

Also in [20] Chandy, et. al. show that similar 
analogues to Norton's theorem hold exactly for the networks 
of Baskett, et. al . [6], which satisfy the local-balance 
conditions. In an accompanying paper [21], they discuss an 
iterative algorithm, based upon these analogues of Norton's 
theorem, for arriving at approximate solutions to network 
problems which do not satisfy the local-balance conditions. 
It is apparent from the examples in this paper that setting 
of tolerance limits for use in the algorithm to ensure a 
given degree of accuracy in the final solution may be a 
difficult problem. The same tolerance limits yield 
different degrees of accuracy in different problems. A 
refinement of this approximate analysis technique is 
discussed by Sauer and Chandy [98], who concentrate on 
central-server models. 

In the mid-1960's Wallace and Rosenberg [108] 
reported on BQA (for recursive queue analyzer) , a computer 
program designed to numerically solve the balance equations 
resulting from Markovian queuing networks. The relationship 
between the procedures used in RQA and the procedures 
presented in this thesis is discussed in Chapter III. In 
[1C8] Wallace and Rosenberg admit that representation of a 
model in the form of a matrix which can be used by RQA is 
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tedious. Ho^iever, a more recent paper by Irani and Wallace 
[55] describes an effort zo design another program capable 
of translating a graphical description of a network of 
queues into a form which can be used by RQA, thus relieving 
this tedium. The author has not seen a final report on this 
effort, and it is not clear from [55] that such a program 
was in existence at the time the paper was written. Use of 
RQA has been reported by Smith [102] and Wallace and Mason 
[107]. Lavenberg [75] has reported using a computational 
method siuilar to that used in RQA. 

This completes the literature review. Attention is 
new drawn to solution of some of the computational problems 
encountered in using the Markov chain approach on networks 
of queues. This is the subject of Chapter II. 
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II. 



CCMPUTA^ONAL PRO^SaS IN 



MODELING 



This chapter addresses the computational problems which 
arise when a network of jueues is modelled as a Markov 
chain. As noted in Chapter I, seme important work has been 
done in cases in which local balance holds and a product- 
form solution results. The emphasis in the current chapter, 
and throughout the thesis, is on those cases in which local 
balance dees not held. 

In the first section the basic model is introduced. 

This model is a closed central-server model with no 
immediate feedback. (That is, jobs leaving a processor must 
precede to a different processor.) Although many of the 
techniques discussed here are applicable to more general 
models, this basic model serves as a focal point and is not 
devoid of useful application. 

The first problem facing anyone who develops a Markov 
chain mcdel is the definition of a state description which 
contains the information necessary to fully describe each 
state and differentiate between states. This is the subject 
of the second section. The vector representations presented 
there are similar to those found in many references, for 
example, Easkett, Chandy, Muntz and Palacios [6]. However, 
the attempt of the author to present computational 
procedures which are easily adaptable to codes for use on 
high-speed digital computers has resulted in a more explicit 
definition of the representations than is often found in the 
open literature. 

Once a representation for the states has been chosen. 



73 



attention turns to development of the equations of 
statistical equilibrium, i. e., the balance equations. 

(Note that the concern here is the long-term average 
results, i. e., the so-called "steady-state" results.) This 
attention results in several computational problems, namely, 
how to generate these equations, how to store them, and 
finally, how to solve them. The first two of these 
questions are the subject of section 4. The final question 
is discussed in the next chapter. 

It is generally inefficienr (not to mention tedious) to 
refer to the states by their vector representations when 
storing and solving the balance equation. It therefore 
becomes useful to sequence the states so that they may be 
referred to by sequence number. In section 3 the 
lexicographic sequencing procedure (so called because cf its 
similarity tc the alphabetical sequencing of words in a 
dictionary, cr lexicon) is described and applied to the 
vectors representing the states. Also in this section is 
found explicit discussion of two related, and quite 
difficult, problems: storage of the vector representations 

of the states, and determination of the sequence number of a 
state from its vector representation. 

The fina] section of this chapter contains a discussion 
of the properties of the system of balance equations. In 
particular, seme of the properties which are important in 
choosing a solution method are pointed out and discussed. 

Most of the concepts and computarional procedures 
introduced in this chapter are first presented in rhe 
context of a particular case of the basic model. This case 
is introduced in subsection 1.1 and is called the "Key 
example" throughout the thesis. A wide variety of 
generalizations of the hey example are discussed in 
subsections 1.2, 2. 2-2.7, 3. 3-3. 9 and 4.2-4. 7. These 
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suhsecticns lay be omitted on a first reading to relieve 
some of the tedium which otherwise results rather naturally. 



1. THE BASIC MODEL 



The basic model of interest in this chapter is what has 

been called by many authors a central-server model. As 

indicated in Figure 1.1 this model is a closed network of 

queues consisting of a central server, hereafter called the 

CPU (central processing unit) , and M peripheral processors 

(hereafter called the PP's) , labeled PP1, PP2, ... , PPM. 

Any job, or customer, completing service at one of the PP's 

is routed directly to the CPU. Any job completing service 

at the CPU is routed directly to one of the PP's. The 

choice of PP in this latter case depends upon a probability 

distribution (p } and is independent of previous choices. 

i 

The individual probabilities in this distribution are termed 
branching pretab iliries. 

Being a closed network, the number of jobs circulating 
among the M+ 1 processors is fixed. We will use N to denote 
this number. 

As indicated by the terminology used here, the author is 
interested in models which are approximations of computer 
systems. This should not detract from the fact that the 
concepts and techniques discussed in these pages are 
developed for networks of queues and not just models of 
computer systems. Many other systems could te modeled as 
queuing networks, some even as central-server models. 

The closed central-server models discussed in this 
chapter are, at best, approximations of computer systems. 
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Ibis is mcst obvious from the fact that the number of jobs 
is fixed. However, as discussed by Gordon and Newell [46] 
and others, many open queuing networks can be reformulated 
as closed networks. Also, the central-server model can be 
used as a submodel in a model which allows the number and 
composition of jobs in the system to vary. 

The model described above is very general. For example, 
as indicated in subsection 2.3, multiple (identical) CPU's 
can be modeled as a single multiserver CPU. Most of the 
techniques discussed are applicable to more general queuing 
networks, as hinted at in subsection 1.2. Some of the 
reasons for considering closed central-server models are 
given at the end of section 5. 

Throughout this chapter a particular example is used to 
introduce and illustrate the various concepts and 
techniques. The generality of these concepts and techniques 
are then discussed in terms of generalizations of this key 
example . 

1.1. The K ey Example 

Per the key example all service times are 

exponentially distributed and all queuing disciplines are 

FCfS. Twe types of jobs, N of type one and N = N - N of 

1 2 1 

type two, circulate in the system. The service rate 

(reciprocal cf mean service time) at each queue and the 
branching probabilities depend upon job type. RAT3(I,J) 
denotes the service rate of a type-I job (I = 1,2) at 
processor J (J = 1 ,2 , . . . , M+ 1 ) , where processor J is the CPU 
if J = M + 1 and PPJ if J < a+1. ALFA (I, J) denotes the 
probability that a type-I job (I = 1,2) routes to PPJ (J = 
1,2,...,M) after completion of service at the CPU. Each 
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queue has cciy one server. 



The key example as described here is the model 
discussed by Gaver and Humfeld [39J. The development 
presented here is similar to that used in [39], although no 
generalizations are discussed there. 

1.2. Gener ali za tions 

At this point generalization of the number of job 
types is obvicus. If K is the number of job types, let N 

k 

be the number of type-k jobs in the system for k = 1, 2, 

..., K and make the restriction that N +N +...+N =N. 

1 2 K 

The rate matrix RATE (I, J) and the brancning probability 
matrix ALIA (I, J) would be expanded to allow I to become as 
large as K. 

Generalization of the service distributions to 
include generalized Erlangian distributions would involve 
addition of another dimension to the rate matrix to indicate 
stage of service. For example, we might use RATS(I,J,1) to 
denote the rate parameter associated with the L-th stage of 
service for type-I jobs at the J-th processor. In addition 
it would be necessary to provide another matrix to indicate 
the probability of completing service at the processor with 
completion at each stage of service. 

Note that the form of the network itself may also be 
generalized. Ey adding another dimension to the branching 
probability matrix, consideration may be given to the most 
general type of queuing network. For example, 

AIFA (I, J,l, J*) might be used to denote the probability of a 
type-I job routing to processor J after completing the L-th 
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stage of service at processor J'. 



Other generalizations will be discussed in the 
following sections. 

2. STATE DESCRIPTIONS 



Note that the structure of the key example involves 
Markov-like assumptions. In particular, service times are 
exponentially distributed (raemoryless) and choice of PE to 
route to, following service at the CPU, is independent of 
choices that may have teen made earlier. Indeed, it is 
exactly these assumptions which allow the construction of 
the balance equations and the solution for steady state, 
both in the key example and in numerous generalizations. 

Nonetheless, sc long as there is at least one processor 
to which johs cf more than one type can route, at which 
different types cf jobs receive service at different rates, 
and which has a FCES queuing discipline, a simple 
compilation cf numbers of jobs present at each processor 
does not define a Markov process. Even an extension tc 
specify the number of jobs of each type present at each 
processor will not define a Markov process. 

To see that this is true, consider the key example in 
which the CPU is a processor of the type described in the 
preceding paragraph. The probability of leaving a state in 
the time interval (t,t+dt), given that the system is in that 
state at time t (a quantity needed in development of the 
Kolmogorov differential equations and subsequently the 
balance equations) , is equal to a sura of terms one of which 
is proportional xo service rate at the CPU for the 
particular job type in service there. Thus, ix is necessary 
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to know which type of job is in service. Further reflection 
concerning what needs to be known in order to develop the 
Kolmogorov differential equations and the balance equations 
reveals that the order in which the types of jobs arrived at 
the CPU, cr equivalently the order in which the types of 
jots currently at the CPU will receive service there, must 
be known. 



2.1. Key Example 

Fcr the present assume that ALFA (I, J) is positive 

for I = 1,2 and J = 1, 2, ..., t1. That is, assume that any 

jot can visit any given processor in the course of its 

travels. Then as discussed above, to specify a state it is 

necessary to specify the order of job types at each 

processor. Towards this end consider a state vector of 

length N + M . The first N components of the state vector 

consist of seme arrangement of N "ones" and N "twos" 

1 2 

representing the N type-one jobs and N type-two jobs in 

1 2 

the system. For J = 1, 2, ..., M component N + J enumerates 

the jobs at PPJ. Hereafter this vector representation is 

termed ISIATE. Now suppose that for each J = 1, 2, ..., 

ISIATE(N+J) = L . Then the following conventions are 

J 

followed ; 

(i) The first L componenxs of ISTATE represent 

the jobs at PP1, the next L components represent 

2 

the jobs at PE2, ..., the next L components 

M 

represent the jobs at PPI1, and the remaining 

components (of the first N) represent the jobs at 
the CPU. 
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(ii) The components of ISTATE representing jobs at 
a given processor are listed in the reverse of 
their order of arrival (the line forms to the 
left), and hence, in the reverse of their order of 
service at that processor. In particular, the 
rightmost component is the type of job in service 
at that processor. 



Example 2 . 1 



Let N = 3, N =4 and M = 2 . Then N = 7 and ISTATE 

1 2 

must have length N + M = 9. Consider ISTATE = 

(2, 1, 2, 1 ,2 , 2 , 1 ,3, 2} . Note that there are = 3 jobs at PP1, 

1=2 jobs at PP2 and N - L - L =2 jobs at the CPU. 

2 12 

This is emphasized in the following diagrammatic 
representation of this state vector. 



job s a t , PP 1 jobs a t P P2 j obs a t C PU L 



1 



1 



1 



At PP1 a type-two job is in service, a type-one job will 
enter service next, and a type-two job will enter service 
after these ethers have completed service. At PP2 a type-two 
job is in service followed by a type-one job. At the CPU a 
type-one job is in service followed by a type-two. jcb 



Example 2.2 



Let N = 3, N = 4, M = 3 and ISTATE = 

1 2 

{2, 1 , 2 , 1 ,2 ,2, 1 ,0, 0, 0} . This is represented diagrammat ically 



by : 
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at PP1 



at PP2 at_PP3 jobs a t CPU L L I 

2, 1,2, 1,2, 2, 1 0 0 C 

All jobs are at the CPU. A type-one job is in service. The 
next six jobs to enter service at the CPU will be of types 
two, two, one, two, one and two respectively. 



Example 2.3 



that 



Suppose = 3, = 4 and M = 5. Suppose further 

it is observed that there are two type-two jobs at PP2, 



two type-one jobs at ?P4, and all other PP's are idle. Then 
the ISTATE vector representation of the current state of the 
system is 

ISTATE = {2, 2, 1 , 1 ,2,2, 1 ,0, 2,0,2,0} if a type-one 
job is in service at the CPU; 

ISTATE = {2, 2, 1,1, 2, 1,2, 0,2, 0,2,0} if the type-one 
job will enter service at the CPU after the 
current job completes service there; or 
ISTATE = {2, 2, 1,1, 1,2, 2, 0,2, 0,2,0} if the type-one 
job will enter service at the CPU only after both 
other jobs there have completed service. 
Digrammatically, these three srates may be represented as 
follows; 






P P2 



2,2 




PP3 PP4 PP5 



1 

1 

1 



1 

1 

1 



CPU 



2 , 2,1 
2 , 1,2 
1 , 2,2 



0 

0 

0 



2 

2 

2 





L 

"5 

0 

0 

0 



In view of Example 2.3 it is evident that any state 
of the system can be represented in terms of an ISTATE 
vector. If ALFA (I, J) is positive for I = 1,2 and J = 1, 2, 
..., a, then a different: state is represented by each ISTATE 
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vector satisfying the following rules; 

Rule 1: The length of ISTATE is N + M. 

Rule 2: The first N components of ISTATE consist 

of N ones and N twos in some order. 

1 2 

Rule 3: Each of the last a components of ISTATE 

is a non-negative integer. The sum of these M 
integers is no larger than N. 

If ALFA (I, J) = 0 for some I and J, seme of the 

ISTATE vectors satisfying these three rules will not 

correspond tc feasible states. For example, if ALFA (1,1) = 

0, any ISTATE vector which has L = ISTATE (N+1) > 0 and 

1 

which has a cne among the first L^ components corresponds to 

an infeasible state. Such cases may be handled in a number 
of ways, seme of which will be discussed in the coming 
sections . 



2.2. Generalization of Num ber of Job T vp es 

If there are K job types and N jobs of type k fer k 

k 

= 1, 2, ..., K, then all of the above applies with Rule 2 
modified to read: 

Rule 2; The first N components of ISTATE consist 

of N ones, N twos, ..., and N ''K”s in some 
1 2 K 

order. 



2.3. Gener al iz aticn of N u m b^ of S erv ers 



Consider a model similar to the key example to model 
a system with L identical CPUs but with a single waiting 
line, so that if more than L jobs are at the CPUs, rhe first 



82 



jot enqueued will te serviced at the first available CPU. 
Then the CPU could be modeled as a multiserver queue. The 
information necessary to specify a state is the same as that 
required for the key example except that rather than the 
order of jobs in service at the CPU we need know only the 
number of jobs of each type in service. 

Keeping this in mind, the ISTATE vector could he 
reformulated to include the number of jobs at each PP (M 
components), the number of each type of job in service at 
the CPU (K components if there are K job types) and, in 
appropriate order, job types for those jobs at the PP's and 
those jots enqueued at the CPU. Because the number of jobs 
enqueued at the CPU varies from state to state, ISTATE will 
have variable length when formulated in this manner. 
Computationally (in FORTEAS, for example) ISTATE must have 
dimension large enough to accomodate the largest possible 
length. This largest length is easily seen to be N + M + K 
(realized whenever there are no jobs at the CPU) . This 
becomes more complicated as the number of multiserver 
processors increases. 

Note, however, that the same formulation for ISTATE 
as was discussed above for the key example can be retained 
if any information concerning multiservers is retained 
separately and techniques are developed to computationally 
handle any resulting complications. Specifically, these 
complications are due to the nonunique representation of 
some states in terms of the ISTATE vector. The following 
example illustrates this point. 



Example 2.4 

Let K = 2, N =1, N =1, M=2 and the number of 
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i 



servers at the CPU, L = 2. Then the state with both jobs at 

the CPU (and therefore both jobs in service at the CPU) is 
represented by both ISTATE = {1,2, 0,0} and ISTATE = 

{ 2 ,, 1 , 0 , 0 } . 



The single server processors discussed earlier can 
be considered as a special case of the multiserver 
processor. However, a more interesting special case is the 
infinite server (IS) queue as introduced in subsection 4.3 
of Chapter I. In the context of closed networks of queues 
where the total number of jobs is limited, the number of 
servers may not be truely infinite. P.ather it is 
sufficiently large to simult aneously accomodate all jobs 
which may visit the queue. In this sense, the CPU is an IS 
queue in Example 2.4. 



2.4. General ization of Qu euing Di sc ip line 

(The queuing disciplines referred to here are 
discussed in seme detail in the second section of Chapter 
I.) 



For all of the disciplines discussed here, the 
structure of the ISTATE vector as originally introduced 
contains all required information. Although special 
reformulations which contain the required information in a 
more explicit form are possible in most cases, problems 
similar tc those discussed above for multiserver queues 
arise. As a result the author feels that it is advisable ro 
retain the current structure of the ISTATE vector and handle 
computationally any resulting complications. 

LCFS: The information required is exactly the same 

as that required fer FCFS. However, the enqueued jobs are 
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listed in their order of arrival rather than the reverse. 

In this way the rightmost job is still the one receiving 
service and the rightmost enqueued job is still the next to 
receive service when a server is available. 

Ncnpreemptive priority: If, for example, type-one 

jobs have higher priority than type-two jobs at certain 
queues then the jobs enqueued at those queues must be listed 
so that all ones are to the right of all twos. 

Preemptive priority: Again, if type-one jobs have 

higher priority than type-two jobs, then all jobs at such 
processors must be listed so that all ones are to the right 
of all twcs. 

ICPSPfi and LCFSPEpt: All jobs at such processors 

must be listed in the order of their arrival with rightmost 
job being the most recently arrived job. If the processor 
has a nonexponential service distribution, as discussed 
below, the stage of service stored for enqueued jobs is the 
stage the job was in when interrupted in the LCFSPR case and 
the highest stage attained by the job since last leaving the 
processor in the LCFSPRpt case. Additional components for 
each server at each LCFSPRpt processor are required to 
record the current stage of service for each job in service. 

P£: The order of the jobs at such processors is 
immaterial. This will be discussed more fully in the coming 
sections, but the meaning of this statement is clarified in 
the following example. 



Example 2.5 

Let K = 2, N = 2, M = 2, M =2 and suppose rhat the 
1 2 
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CEO has a PS discipline. Then the following six ISTATE 



vectors represent the sane state; 



{ 1 , 1 , 2 , 2 , 0 , 0 } 
( 1 , 2,1 , 2 , 0 , 0 } 
( 1 , 2 , 2 , 1 , 0 , 0 } 



( 2 , 1,1 , 2 , 0 , 0 } 
( 2 , 1 , 2 , 1 , 0 , 0 } 
( 2 , 2,1 , 1 , 0 , 0 } 



2.5. Generalizat ion of Servic e Distributions 

Consider modification of the key example to 
incorporate generalized Erlangian service distributions for 
one or more types of jobs at one or more of the processors. 
Then at least for those types of jobs having nonexponential 
distributions and at those processors where these 
distributions apply, it will be necessary tc include in any 
state description the stage of service. (See Example 1. 1 in 
Chapter I.) If all processors have non-preemptive queuing 
disciplines, this may be accomplished by adding 11 + 1 
components to the ISTATE vector (assuming each processor has 
a single server and none have a PS queuing discipline) and 
using these tc store the stage of service for the job in 
service at the M + 1 processors. (Of course, fewer than M + 
1 components need to be added if one or more processors have 
only exponentially distributed service times. Furthermore, 
more than H + 1 components may be required if one or more 
processor has multiple servers or a PS queuing discipline.) 
If one or more processors has a preemptive service 
distribution, N components should be added to ISTATE tc 
record the stage of service for each of the N jobs in the 
system. Ihis number should nor he reduced for reasons 
similar tc those discussed under mulriple server queues 
above . 



86 



2.6. Fin it e Cap acities and Block ing 

Consider the key example with the following change; 
The CPU has a finite capacity, C. That is, if there are C 
johs at the CPU (in service and enqueued) and a job 
completes service at one of the PP's, rather than routing to 
the CPU the job stays at the PP blocking other jobs enqueued 
there from receiving service. When a job subsequently 
completes service at the CPU and routes to one of the PP's, 
a job will simultaneously route from one of the blocked PP's 
to the CPU, allowing the next job (if any) enqueued at that 
PP to begin service. 

Such models can be handled using the techniques 
discussed in this taesis if soma procedure is specified for 
determining which PP will be unblocked by a service 
completion at the CPU if more than one PP is blocked. 

Options for such procedures include: 

(i) Priority scheme: e.g., PPl is unblocked 

before PP2, which is in turn unblocked before ??3, 

• • • 

(ii) Probabilistic scheme: e.g., having assigned 

a positive number Q to PPi for i = 1, 2, ..., M, 

i 

the probability that PPj is the one unblocked 
(given that it is blocked) is equal to Q divided 

j 

by the sum of the Q 's over ail blocked PP's. The 

i 

Q 's are relarive priority numbers in that PPi is 
i 

twice as likely to be unblocked as PPj if botn are 
blocked and Q is twice as large as Q . 

i j 

(iii) A combination of (i) and (ii) : e.g., first 

priority is given to the PP to which the job 
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completing service at the CPU routes; if this PP 
is not blocked, each of the blocked PP’s are 
equally likely to be the first unblocked. 

(iv) FIPO scheme: i.e., the PP which has been 

blocked the longest (or shortest for a LIFO 
scheme) is the one unblocked. 

iihatever the scheme, the model can probably be 
handled by addition to ISIATS of an additional component for 
each PP specifying its blocked/not blocked condition. In 
the priority and probabilistic schemes (and combinations of 
them) this component can be set equal to one when the PP is 
blocked, and equal to zero otherwise. In a FIFO (or LIFO) 
scheme the value taken on by this component should indicate 
its relative position among the blocked PP's. For example, 
it may be set equal to zero if the PP is not blocked, one if 
it has been blocked longer than any other blocked PP, two if 
only one PP has been blocked longer, etc. 

If one or more of the PP's is a mulriserver queue 
and a priority or probabilistic scheme is used, the 
blccked/nct blocked component can be used to indicate rhe 
number of servers blocked. In such cases, the order of the 
job types in ISTATE for a multiserver PP should be those 
enqueued followed by those being served followed by those 
blocked. 

If cne or more of the PP's (rather than rhe CPU) has 
finite capacity, a component should be added to ISTATE for 
each server at the CPU. (The CPU is assumed to not have a 
PS discipline.) In this case the component corresponding to 
a given server should tell which PP is causing blockage of 
that server. 

If cne or more PP's and tne CPU have finite 
capacity, care should be taken that it is not possible to 
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deadlock the system by having the CPU blocked waiting for a 
vacancy at seme PP and having that same PP blocked waiting 
for a vacancy at the CPU. If such a state is feasible, it 
is an absorbing state and the resulting steady-state 
conclusions are predictable. (For more on deadlocking, the 
reader is referred to Havender [49].) 



2.7. Bu lk i ng 

Another phenomenon which can be handled using the 
techniques discussed in this thesis is bulking. Bulking 
occurs when jobs (possibly of a given type) are required to 
leave a given processor in groups of two or more. For 
example, if type-two jobs must route from the CPU in groups 
of three, a type-two job completing service there will enter 
a bulking queue if no more than one other type-two job is in 
the bulking queue or will route with two other type-twc jobs 
(both from the bulking queue) to one of the PP's. To handle 
bulking it is^ necessary to add one component to ISTATE for 
each processor at which bulking occurs. This component will 
hold the number of jobs in the bulking queue for that 
processor. The order of job types in ISTATE for a processor 
at which bulking can occur should be (from the left) those 
enqueued for service, those being served and those in the 
bulking queue. 

3. STORAGE AND SEQUENCING OF STATES 



In the next section it will be shown that the ISTATE 
vector form cf state representation lends itself readily to 
generation of the balance equations. However, informative 
as this representation is, and as important as it may be in 
generation of the balance equations, there are some 
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important reasons for sequentially numbering the states; 
i.e., assigning to each state a name, its state number, from 
among the positive inxegers so that it may be referred to 
directly without reference to a lengthy string of numbers. 



First, when properly sequenced, the states may be 
considered one at a time without fear of inadvertently 
emitting or repeating states. This is important since 
several passes through the entire list of states is required 
to generate the balance equations, solve them for the 
steady-state probability distribution, and from this 
distribution calculate the desired measures of system 
performance . 



Second, storage of the balance equations is considerably 
more efficient if done in terms of the state numbers rather 
than the vectors. Each balance equation involves reference 
to at least two states and, in many cases, more. Consider a 
balance equation referencing five states in a model in which 
there are seven jobs (N = 7)i^and three PP*s (M = 3) . The 
state numbers for this balance equation would require five 
storage spaces; the state vectors would require fifty. If, 
for example, all processors were modeled as FCFS single 
server queues (the key example) and the seven jobs were six 
of one type and one of another, then five is about the 
average number of references for each of the 840 balance 
equations. So the advantage in this case is storage of 
about 42CC numbers as compared to 42,000. 



Third, to solve the balance equations it is not 
necessary to know anything about the individual states. All 
that is required is a way of differentiating between the 
states. Sequential numbering of the stares provides not 
only this capability, but also a convenient way to proceed 
through the balance equations while computing a solution. 
(That is, in a F0HT31N program we will be allowed to use a 
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"DO loop.") 



As is evident froia the above paragraphs, the balance 
equations shculd be stored in terms of the state numbers. 
However, the information contained in the vector 
representation is required to generate the balance 
equations. Hence, a procedure is required for efficient 
deter minaticn of the state number from its vector 
representation. The problems of sequentially numbering the 
states, storing their vector representations and determining 
the state number from the vector representation are 
interconnected and will be considered simultaneously. 

3.1. Le xic o graphi c Sequ en cing of Inte ger Vect ors 

The techinque which leads to useful sequencing of 
the ISIAIZ vectors is constrained lexicographic ordering. 
This technique has long been used in mathematical analysis 
as a means of proving that the rational numbers in the 
interval (0,1) are countable. It has also been useful in 
enumeration algorithms for integer programs. 

The basic procedure for constrained lexicographic 
sequencing cf integer vectors is summarized in the following 
steps: 

(i) To get the first vector, start with the 
leftmost component and successively set each 
component equal to the smallest number which will 
yield a vector satisfying the constraints. 

(ii) Each successive vector is derived from its 
predecessor by increasing the value of one of the 
compcnents and successively setting each component 
to its right equal to the smallest number which 
will yield a vector satisfying the constraints. 

In each case the component increased is the 
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rightmost component which can be increased without 
violating the constraints. The number to which it 
is increased is the smallest number (larger than 
its previous value) which will satisfy the 
constraints. 

A more detailed version of this procedure is presented in 
Algorithm 1 in Appendix B. 

The constraints referred to here vary from problem 
to problem. In gereral, they are equivalent to the 
characteristics of the vectors to be sequenced. In the case 
of integer programs, the constraints of the program are the 
ones referred to above. The following example shows hew 
constrained lexicographic sequencing of integer vectors is 
used to prove that the rational numbers in the interval 
(0,1) are countable. In doing so it not only exemplifies 
the use cf the procedure outlined above, but also clearly 
sets forth the constraints and shows their relationship to 
the character of the vectors sequenced. 

Example 2.6 

In order to prove that there are a countable number of 
rationals between zero and one, it is necessary to exhibit a 
mapping from the counting numbers onto the set of raticnal 
numbers in (0,1). (See Halmos, [48].) In ether words, 
these rational numbers must be sequenced so that given any 
particular rational number, r, some counting number, i, may 
be determined such that r is the i-th number in the 
sequence . 

Toward this end recall that a rational number is the 
ratio a/t of two integers. Since the goal is to sequence 
the raticnals in (0,1) , a and b may be assumed positive with 
a < b. Now consider the set of all vectors {b,a} with a and 
b positive integers and a < b. Since such a vector can be 
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such 



found for each rational number in (0,1), and since each 
vector can correspond to only one rational number in (0, 1) , 
the proof is complete if the vectors can be sequenced. In 
mathematical terms, the mapping which takes {b,a} to a/b is 
a mapping frcm the set of all positive integer pairs {t,a} 
such that a < b onto the set of rational numbers in (0,1). 
This mapping composed with the mapping of the counting 
numbers onto the set of such integer pairs (represented by 
the sequencing of these pairs) yields a mapping of the 
counting numbers onto the set of rational numbers in (0,1), 
Exactly how this works will be clarified after the above 
procedure has been used to sequence the vectors. 

First the constraints must be derived. If £b, a} is a 
"valid" vector (that is, one of the vectors to be 
sequenced) , then a and b are positive integers and a < b. 
Thus, the constraints to be used in the sequenceng procedure 
are: 

Rule 1: a and b are positive integers. 

Rule 2: a < b. 

Now that the constraints have been specified, the 
procedure outlined above can be used to sequence the 
vectors. The first vector is determined using step (i) . 
According to this step, the components are determined from 
left to right. That is, first b and then a will be 
determined. Furthermore, each component will be set equal 
to its smallest feasible value. 3y Rule 1 this value is one 
for both a and b. However, Rule 2 restricts b to be larger 
than a. Thus, the smallest value of b which will lead to a 
valid vector is two. The choice for a is necessarily one 
and the first vector is (2,1). 

Having found the first vector, the second, third, 
fourth, and so forth vectors are determined by successive 
application cf step (ii) . In determining the second vector, 
note that increasing a will violate Rule 2. Thus, let b = 3 
and set a equal to the smallest value which yields a vector 
satisfying the constraints. The second vector is seen to be 
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{3,1}. The value of a can be increased to derive the third 
.vector; {3,2}. Step (ii) is used successively to generate 
the remaining vectors: C^/1}r {^/2}, {4,3}, {5,1}, {£,2}, 

• • • 

To determine the sequence number of a particular 
rational number, r, in (0,1), first find positive integers a 
and b such that r = a/b. Then, r is the i-th rational (more 
formally, i gets mapped onto r) if {b,a} is the i-th vector. 
For example, if r = .375, r = 3/8 and r is seen to be the 
24-th rational number according to this sequencing 
procedure. 

Note that in this example each rational number in 
(0,^1) has more than one (in fact, an infinite number) of 
sequence numbers. For example, r = .375 is not only the 
24-th rational, but also the 111-th (r = 6/16) and the 
1977-th (r = 24/64) . 

Cn the other hand, each "valid" vector has a unique 
sequence number. This fact is important in the applications 
of interest in this thesis. In these applications each 
state of a system is denoted by a unique integer vector. 
Using the lexicographic sequencing procedure outlined above, 
these vectors are sequenced. The sequence numbers are 
called state numbers and are used in storage of the balance 
equations. It is important that each state has only one 
state number, so that inadvertent repetition of states may 
be avoided. 

Cf course, it is also important to avoid inadvertent 
omission of states. This is done by carefully choosing the 
vector representation and the constraints sc that each state 
is uniquely defined by a vector satisfying the constraints 
and each vector satisfying the constraints describes a 
unique state. In section 2 the ISTATE vector representation 
was introduced for the key example and a variety of 
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generalizations. Referring to subsection 2.1, the 
constraints to be u-sed in the lexicographic sequencing of 
the ISTATE vectors for the key example are: 

ISTATE Sequencing Constraints 
Rule 1: The first N components contain N ones 

and N twos. 

2 

Rule 2: The remaining M components are 

nonnegative integers whose sum is no larger than 

N. 

The following example illustrates the application of the 
lexicographic sequencing procedure to the ISTATE vectors for 
a particular case of the key example. 



Example 2.7 

Consider the key example with N = 2, N =2 and M = 

1 2 

2. The following table shows 27 of the 90 states with their 

corresponding state numbers as provided by the lexicographic 
ordering procedure above applied to the ISTATE vectors. The 
entries in the column labeled NS are the state numbers. 



c 


ISTATE 


NS 


ISTATE 


NS 


ISTATE 


1 j 


[1,1 ,2, 2, 0,0] 


10 i 


[1, 1,2, 2, 2,0] 


19 1 


[1,2, 1,2,0,33 
ri, 2, 1,2,3,11 


2 


1, 1 , 2, 2,0, r 


11] 


[1, 1,2, 2, 2,1] 


29 1 


3 i 


1,1, 2, 2, 0,2 


12 


1,1,2,2,2,2' 


30 1 


1,2, 1,2, 4, O' 
1,2, 2,1 ,0,0 


4 ] 


1, 1,2, 2,0,3; 


13 ) 


1, 1,2, 2,3, o: 


3 1] 


5 


1,1 ,2, 2, 0,4 


14 


1, 1,2, 2, 3, 1 


45 i 


1,2,2,1,4,01 


6 


1, 1,2,2, 1,0 


15 i 


1, 1,2, 2, 4,0 


46 1 


2,1,1,2,0,01 


7 ] 


1,1, 2, 2, 1,1 


16 


■1,2, 1,2, 0,0 


6 1] 


2, 1,2, 1,0,0 


8 i 


1,1,2,2,1,2' 


17 1 


1,2,1,2,0,i; 


76 ] 


2,2,1,1,0,01 


9 1 


1,1, 2, 2, 1,3 


18 


1,2, 1,2, 0,2 


90 ] 


2,2, 1,1, 4,0 



Note that for this example Rule 1 states that the first four 
components must consist of two 1 's and two 2's. Rule 2 
states that the sum of the last two components must be no 
larger than four. In setting up the first state vector, the 
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first twc comfcnents are set equal to one since by Rule 1 
this is the smallest value these components may take on. 

Rule 1 new forces the third and fourth components to be set 
equal to two. The last two components are both set equal to 
zero, the smallest values they can take on and still satisfy 
Rule 2. In progressinq from state to state; the sixth 
component is successively incremented until (state 5) it 
cannot be incremented further without causing violation of 
Rule 2. At this point the fifth component is incremented, 
and the sixth component is set at its smallest feasible 
value. Successive states are determined by again 
incrementing the sixth component. This procedure continues 
until state 15 wher« further incrementing either of the last 
two components will cause violation of Rule 2. Incrementing 
either the third or the fourth component will cause 
violation of Rule 1. So, to get state 16, the second 
component is incremented, the third and fourth components 
are set at the smallest values allowed by Rule 1 (one and 
two respectively) , and the last two components are set at 
the smallest values allowed by Rule 2. From here the 
procedure starts again to increment the last component in 
determining successive states. 

For the key example lexicographic sequencing of the 
ISTATE vectors is convenient, and an efficient computational 
scheme can be developed for determining the state number 
from the vector representation. For the generalizations 
discussed earlier in this chapter, this same sequencing 
procedure could be used if further constraints are imposed. 
However, other sequencing procedures are found to be more 
convenient or more appealing for some of these 
generalizations and, at rhe same time, no less convenient 
for the key example. Furthermore, they allow computational 
analysis of one of the solution procedures (see Chapter 
III) . If the states are arranged according to a 
lexicographic sequencing of the ISTATE vectors, similar 
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analysis is considerably more complicated and remains an 
open problem (see the conjecture in subsection 4.3 of 
Chapter III.) 

Ihese other sequencing procedures are based on 
lexicographic ordering of vector representations which are 
derived from the ISTATE representation. In this subsection 
we will discuss one of these other procedures. A second 
will be discussed after the problem of storage of the states 
has been considered. 

Consider splitting the ISTATE vector representation 
of a state intc two parts. (This will be found to be a 
useful idea when we discuss storage of the states in the 
next subsection.) The first N components will be referred 
tc as the left subvector and the last M as the right 

sutvector. Ihe ISIATEmI (ml stands for modification one) 

vector representation of the state is a (row) vector of 

length N + il + 1 whose first component is the sum of the 

components of the right subvector (i.e., the total number of 
jots at the PP's) , whose next M components form the right 
sutvector, and whose last N components form the left 
subvector. The constraints used in the lexicographic 
ordering procedure for these vectors are: 

ISTATEmI Sequencing Constraints 
Rule 1: The first component is a nonnegative 

integer nc larger than N. 

Rule 2: The next M components are nonnegative 

integers whose sum is the first component. 

Rule 3: The last N components consist of N ones 

1 

and N twos. 

2 
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Example 2.8 



Again consider the key example with = 2, = 2 and 

a = 2. The following table shows 20 of the 90 states with 
their state numbers as provided by a lexicographic ordering 
of their ISTATEmI vector representations. Both the ISTATEmI 
and IST'ATE vector representations are given for each lisxed 
state . 



NS ISTATEmI ISTATE 



1 i 


[0 


, 0,0 


1 


, 1,2 


, 2 ] 




[ 1,1 


,2 


,2 


, 0,01 


2 j 


0 


, 0,0 


1 


, 2 , 1 


,21 




1,2 


, 1 


, 2 


, 0 , 0 ] 


3 


0 


, 0,0 


1 


, 2 , 2 


, 1 ] 




1,2 


,2 


, 1 


, 0 , 0 ' 


4 i 


'0 


, 0,0 


2 


, 1,1 


,2 




■ 2,1 


, 1 


,2 


, 0 , 0 - 


5 


‘0 


, 0,0 


2 


, 1,2 


, 1 ‘ 




2,1 


,2 


, 1 


, 0 , 0 ‘ 


6 1 


’0 


, 0,0 


2 


, 2 , 1 


, V 




‘ 2,2 


, 1 


, 1 


, 0 , 0 ' 


7 


'1 


, 0 , 1 


1 


, 1,2 


,21 


• 


1,1 


,2 


,2 


, o,r 


8 


^1 


, 0 , 1 


1 


, 2 , 1 


,2 




1,2 


, 1 


,2 


, o,r 


12 


‘1 


, 0,1 


2 


, 2 , 1 


, V 




2,2 


,1 


, 1 


, 0,1 


13 


■1 


, 1,0 


1 


, 1,2 


, 2 ' 




■ 1,1 


,2 


, 2 


, 1 , 0 ‘ 


18 


1 


, 1,0 


2 


, 2 , 1 


, 1 ' 




’ 2,2 


,1 


, 1 


, 1 , 0 ; 


19 j 


'2 


, 0,2 


1 


, 1,2 


,2 




■ 1,1 


,2 


,2 


, 0,2 


25 


■2 


/!/ 1 


1 


, 1,2 


/ 2 : 




1,1 


,2 


,2 


, 1 , 1 ' 


31 




r2,0 


1 


, 1,2 


, 2 ^ 




■ 1,1 


,2 


,2 


, 2,0 


37 


< 


, 0,3 


1 


, 1,2 


,21 


' 


1,1 


,2 


,2 


, 0,3 


60 


■3 


, 3,0 


2 


, 2,1 


,r 


' 


2,2 


, 1 


, 1 


, 3 , 0 ‘ 


61 i 


'4 


, 0^4 


1 


, 1,2 


, 2 : 


' 


■ 1,1 


,2 


,2 


, 0 , 4 ' 


67 i 


'4 


, 1/3 


1 


, 1,2 


/2 




1,1 


,2 


, 2 


, 1,3 


85 1 




, 4,0 


1 


, 1,2 


,2 




1,1 


,2 


,2 


, 4 , 0 } 


90 i 




, 4,0 


2 


, 2 , 1 


, 1 




2,2 


,1 


, 1 


, 4 , 0 ] 



The following is a diagrammatic representation of the 
ISTATEmI vector representations of three of the states. For 
reference purposes the state numbers have also been given. 

at PP 1 at_PP2 at C PO 

2 11 1 12 2 
2 2 0 1 1 2 2 
3 0 3 1 1 2 2 

3.2. St orage of the S tate s 

Once a sequencing procedure has been established, it 
can be used whenever needed to sequentially generate the 



NS 



25 

31 

37 
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states. If, on the other hand, the states could be 
efficiently stored, they could be recalled from storage when 
needed rather than going through the sequencing procedure 
each time. In addition, a well-designed storage procedure 
can aid immeasurably in determining state numbers from 
vector representations. 

Perhaps the simplest storage procedure is to store 

the vectors in an array, say NAME, so that NAME(i,j) 

contains the value or the j-th component of the ISTATE 

vector representation (or some other vector representation) 

of state i. This is easily seen to be inefficient. For 

example, in modeling the key example with N = 6, N =1 and 

1 2 

a = 3, NAME would have to be dimensioned 840 by 10, 
requiring 8400 storage locations. 

Next, consider converting each vector into a single 
number and storing the number. For example, in the example 
just cited the state i whose ISTATE vector representation is 
{1,1,2,1,1,1,1,3,1,03, could be stored in NAME(i) as 
43101121111 (from the ISTATEml vector repr esentarion) cr 
1121111310 (from rhe ISTATE vector representation) . This 
procedure appears to require only one storage location per 
state. However, there is a limit to the number of digits an 
integer stored in a computer can have. Storing in doable 
precision will double the space required to store the states 
and may net be sufficient for seme of the generalizations. 

On the other hand, if the states are stored according to the 
vector representation used to sequence them, the numbers 
stored would be in ascending order, and an efficient search 
technique could be designed to determine the state number 
for a state given its vector representation. 

By storing separately the right and left subvecters, 
the space required to store the states can be reduced and at 



99 



the same time a more efficient method for determining the 
state number from the vector representation can be 
developed. Consider how this might be accomplished for the 
key example sequenced according to a lexicographic ordering 
of the ISTATEml vector representation. 

Each subvector will be stored as a single number as 

described above. In this way the 840 states of the example 

considered above would require 127 locations for storage of 

the 120 right subvectors and 7 left subvectors. The same 

problems arise concerning the number of digits a stored 

integer can have. However, since the left and right 

subvectors are stored separately, the variety of cases which 

can be handled without utilizing double precision is vastly 

increased. In fact, in many cases storage of the balance 

equations and auxilliary storage required to solve them will 

cause core problems before there is any need to worry about 

utilization of double precision for state storage. (The key 

example with N = 5, N =3 and K = 4 results in 147,340 
1 2 

terms in 27,720 balance equations. As indicated in 

subsection 2.3 of Chapter IV, a case with 35,280 terms in 
6,930 balance equations required 396K bytes of storage on an 
IBM 360-67. Of this, nearly 127K bytes were required for 
storage of the balance equations, state probabilities and 
state vectors. The earlier example would require over 527K 
bytes to store these same things.) Furthermore, the idea of 
splitting a vector into subvectors and storing only the 
subvectors could be used on a subvector too large to stcre 
in single precision. 

New, consider storage of the left subvectors. 

Storage will be in terms of a single number for each 
subvector. For example, if N =2 and N =3, {2, 1,2, 2,1} 
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is a valid left subvector and will be stored as 21221. A 



computationally efficient method of performing such a 
translation from integer vector to integer scalar is given 
as Algorithm 2 in Appendix 3. An easy dual to this 
algorithm can be used to produce an integer vector from a 
stored scalar. Storage of the left subvectors is 
accomplished by using Rule 3 of the ISTATEmI procedure to 
successively generate the subvectors. Each time a new 
subvector is generated. Algorithm 2 can by used to determine 
the corresponding number to be stored. A computational 
method can be developed to derive each stored number from 
its predecessor by addition of some powers of 10 and 
subtraction of others based upon which components of the 
left subvector must be increased and which decreased to 
derive the next left subvector. 

Storage of the right subvectors may be accomplished 
in exactly the same manner. However, if the states are 
ordered according to the ISTATEmI scheme, a relatively 
simple computational schema makes it possible to store the 
appropriate numbers without halving to use Rules 1 and 2 to 
successively generate the right subvectors. 

The numbers to be stored are all integers of M 
digits or less whose digits sum to N or less. These numbers 
are to be stored in groups with those whose digits sum to 
zero first, followed by those whose digits sum to one, rhen 
two, and so forth up to those whose digits sum to N. Within 
each group, the numbers are to be stored in ascending order. 
The number of numbers in the k-th group (i.e., the number of 
integers of M digirs or less whose digits sum to k) is 

C , the number of combinations of M - 1 + k things 

M-1+k,k 

taken k at a time, which may be calculated from: 
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C = n! / [ rl (n-r) ! ] 
n,r 



The 0-th group contains only zero and is not stored. 

0 12 a-1 

The 1-th group consists of 10 = 1, 10 , 10 , .../ 10 

For k > 1, each number in the k-th group is the sum of a 
number 'in the 1-th group and a number in the (k-l)-th group. 
Algorithm 3 in Appendix B uses these facts to store the 
right subvector in an array (KSTG) without generating and 
translating them. This algorithm also requires two 
auxiliary vectors, MSTG1 and NSTG2, each of length N. For 
k = 1, 2, ..., N, NSTGl(k) is the component number of the 
first and NSIG2(k) of the last number in KSTG in the k-th 
group . 



Example 2.9 



Consider the key example with N = 2, N =2 and M = 2 

12 

discussed in Examples 2.7 and 2.8. Recall that this system 
has 90 states so that storing each state as a single integer 
would require 90 storage locations. By using Rule 3 of the 
lexicographic ordering procedure for the ISTATEmI vector 
representation, the left subvectors can be stored in a 
vector LSTG; 

L3TG = {1122,1212,1221,2112,2121,2211} 

Using Algorithm 3 the right subvectors can also be stored: 
KSTG = {1,10,2,11,20,3,12,21,30,4,13,22,31,40} 

The total storage required (including four spaces each for 
the auxilliary vectors NSTGI and NSTG2) is 28 spaces. Note 
that if no use is to be made of the contents of NSTGI and 
NSIG2 later, KSTG could be generated first and NSTGI and 
NSTG2 could use the same storage locations as LSTG. 

iJiO questions remain. First, how can the states be 
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sequentially generated once KSTG and LSTG have been filled 
in? Second, how can the state number be determined from 
either the ISIATE or the ISTATEmI vector representation of a 
state? 



First, observe that, based upon the three 
ISTATEmI lexicographic ordering, the left subvecto 
successively takes on of its possible va lue s 

sucessive changes in the right subvector. The ans 
first question is now quite natural: 



Algorithm for 

Sequential Generation of States 

(i) Set the right subvector at its first feas 
value, namely, all zeros. 

(ii) To get the first k states, where k is th 
length of LSTG, successively set the left 
subvectcr to the values indicated by the 
components of LSTG, leaving the right subvect 
fixed. 

(iii) Repeat (ii) with the right subvector se 
successively at the values indicated in KSTG. 



This procedure is 
of components of K 
component of NSTG2 
subvector. The nu 
of distinguishable 



demonstrated in Example 2.8. Th 

STG is the value of the last (N- 

if Algorithm 3 is used to store 

mber of components of LSTG is th 

permutations of N ones and N 

1 2 



i.e . , 



C 

N,N ^ 



NI / (N 




To determine the state number of a state f 
its ISTAIE or its ISTATEmI vector representation, 
formula 



rules for 
r 

between 
wer to the 



ible 

e 



o r 



t 



e number 
th) 

the right 
e number 
twos , 



rora either 
use the 
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(2.1) state number = k x C + m 

N, N 

1 

where the right subvector of the vector representation of 
the state corresponds to the k-th component of KSTG (the 
zero vector corresponds to the "0-th component" of KSTG) , 
and the left subvector corresponds to the m-th component of 
LSTG. To determine the values of k and m, convert the left 
and right subvectors of the state into integers and do a 
search through LSTG and KSTG. While converting the right 
subvector, the sum of the components should be accumulated. 
If this sum is zero, k = 0. If the sum is s # 0, then KSTG 
need be searched only from component NSTGI(s) to component 
NSIG2 (s) . A search in LSTG is simplified by the fact that 
the values of the components increase with component number. 

Algorithm 4 of Appendix B gives an alternative to a 
search in LSTG. No comparison has been made between this 
algorithmic procedure and a search procedure. It seems 
likely that the search pro-cedure is better when the length 
of LSTG is relatively small, and the algorithmic procedure 
is better when it is relatively large. It is not known 
whether the algorithm can be extended to cases in which the 
number of job types exceeds two. 

3.3. Ge ner aliza tion of N umber of Job Types 

The development in the preceding subsection still 
applies with the obvious change of Rule 3 for lexicographic 
sequencing of the states according to the ISTATSmI vector 
representation : 

Rule 3: The last N components consist of ones, 

N twos, ..., N K's where K is the total number 
2 K 
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of job types. 

The comb iDatorial term in (2.1) must be changed to 

N!/(N IN I...N !) 

1 2 K 

Example 2.10 

Suppose K=3, N =2,N =1,N =1 and M = 2. Then 

1 2 3 

KSTG is exactly as reported in Example 2.9 and 

LSTG = {1 123, 1132,1213, 1231,1312, 1321,21 13,2131,231 1, 
3112, 3121,321 1} 

The combinatorial term to be used in (2.1) is C = 

41/ (2! 1111) = 12 which can be verified by counting the 
components of LSTG. Using (2.1) we will now calculate the 
state number for the state having ISTATE vector 
representation { 1 , 3, 1 , 2, 0, 3} . Note that its ISTATEmI vector 
representation is (3 ,0 ,3 , 1 ,3 , 1 ,2} . Its left subvector 
corresponds to 1312, the fifth component of LSTG. Its right 
suhvector corresponds to 3, the sixth component of KSTG. 

The state number is; 6x12+5= 77. The total number of 
states is 180. 

3.4. Ge neraliz ation of N um ber of Serv ers 

first consider the case in which the CPU is the only 
multiserver gueue with L > 2 servers. Then, as pointed out 
in subsection 2.3, the order of the jobs in service at the 
CPU is immaterial. The result is that some states will be 
represented more than once in a lexicographic ordering of 
the states (whether based on the ISTATE or the ISTATEmI 
vector representation) . One way to handle this problem is 
to redefine the left subvector to be "those of the first N 
components representing jobs not in service at the CPU." 
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Note that this leads to a variable-length left subvector. 
When the number of jobs at the CPU is zero, the length of 
the left subvector is N. When the number of jobs at the CPU 
is N, the length of the left subvector is N - L if L < N or 
zero if the CPU is an IS queue. Since the left subvector is 
at the right side cf the ISTATEmI vector representation, an 
advantage is realized by sequencing the states according to 
this representation. Namely, by revising the sequencing 
procedure to ignore a number of components at the right end 
of the vector (where the number to be ignored can be 
calculated from the first component of the vector) , no 
states will be duplicated. This can be accomplished by 
changing Buie 3 (for the lexicographic ordering procedure 
for the ISTATEmI vectors) to read: 

Rule 3: The last m = min(L,N-k) components (where 

k is the value of the first component) are all 
zeros. The N - ra components immediately preceding 
these consist solely of ones and twos. There must 

be no more than N ones and no more than N twos. 

1 2 



When storing the left subvectors (as redef 
above) , those cf zero length are not stored. Thos 
positive length are stored in ascending order. Th 
shortest subvectors are listed first, followed by 
shortest, and so forth up to those of length N. S 
auxiliary vectors may be maintained to aid in dete 
state numbers from vector representations and sequ 
generation of the states. For example, NSTG2 may 
hold information concerning the location in LSTG o 
beginning and end of each group cf numbers. For k 
..., N, component NSTG2(k) + 1 would contain the f 
NSIG2(k+1) the last of the k-digit numbers in LSTG 
Rule 3, the subvectors can be sequentially generat 
allowing m to successively take on values from min 
down to zero and, for each value of m, generating 



ined 
e of 
us, the 
the next 
everal 
r mining 
ential 
be used to 
f the 
= 1 / 2 , 
irst and 
. Using 
ed b j 
(L,N-1) 
in 
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lexicographic order all feasible left subvectors of length 
N - m. Each time a new subvector is generated it is stored 
in LSTG, and each time the value of m changes the 
appropriate component of NSTG2 would be given a value. 
However/ as pointed out in Algorithm 5 of Appendix B, cnce 
the shortest subvectors have all been stored in this manner, 
a computational scheme can be used to fill in the remainder 
of LSTG and NSTG2. This algorithm utilizes the fact that 
each (k+1) -digit number in LSTG is the same as some k-digit 
number in LSTG with eirher a one or a two attached to its 
right end. 

After an example consideration will be given to the 
two recurring guestions: How may the states be sequentially 

generated frcm the stored subvector? And how can the state 
number be derived from the vector representation? 



Example 2.11 (a) 



Consider a model similar to the key example except 
that the CPU is a multiserver with L = 2. Suppose H =2, 



= 3 and M = 2. The minimum length of a left subvector 

stored in LSTG is max(1,S-L) = 3. The steps of Algorighm 5 
yield : 

(i) NSIG2(1) = 0, NSTG2 (2) = 0, NSTG2 (3) = 0 

(ii) Starting with the first component, LSTG = 
{112,121,122,211,212,221,222}. Note that 111 is 
not stored since N1 = 2. 



(iii) NSTG2 (4) = 7 
(iv-vii) LSTG (8) = 1122, 
LSTG(9) = 1212, 
LSTG(IO) = 1221 
derived from LSTG (3) 

Starting at the 



derived from L3TG(1) 
derived from LSTG (2) 
and LSTG (1 1) = 1222, 

twelfth component. 



bot t 
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LSTG = {2112,2121,2122,2211,2212,2221} 

(iii) NSTG2(5) = 17 

(iv-vii) Starting 4 t the eighteenth couponent, 

LSTG = {11222,12122,12212,12221,21122,21212, 

21221 , 22112 , 22121 , 22211 }. 

(iii) NSIG2(6) = 27 
An applicaticn of Algorithm 3 yields: 

KSTG = {1,10,2,11,20,3,12,21,30,4,13,22,31,40,5,14, 
23,32,41,50} 

and 

NSTG1 = {1,3,6,10,15,21} 

where the last component or NSTGl has been put in so that 
for k = 1,2>2,4,5, the numbers in KSTG whose digits sum to k 
start at component NSTGl (k) and end at NSTGl (k+1) - 1. 

Using the resulrs of this example as a guide, it can 

be seen how the states may be generated sequentially from 

the information stored in KSTG, LSTG, NSTGl and NSTG2. 

First set the right subvector ar its first feasible value, 

the zero vector. The length of the left subvector is k = 

max(0,N'-L) since all N jobs are at the CPU. If k = 0 (i.e., 

if the CPU is an IS queue) , then there is one such state. 

(The left subvector can be filled in with N ones and N 

1 2 

twos in any order if an ISTATE vector representation is 

required.) If k > 0 , then leaving this right subvector 
fixed, generate successive states by successively setting 
the left subvector equal to the values indicated by the 
numbers stored in LSTG from component NSTG2(k) + 1 to 
component NSTG2(k+1). From this point on, the successive 
states are generated by setting the right subvector 
successively to the values indicated by the numbers stored 
in KSTG, for each right subvector, determining the length, 
k, of the left subvector and setting the left subvector 
successively to the values indicated by the numbers stored 
in the appropriate section of LSTG. Note that k need not be 
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recalculated each time the right subvector is changed since 
it will be constant for ail components of KSTG from 
ccmponent NSIGl(m) to component NSTG1(m+1) - 1, for m = 1, 

2/ •••/ N. 



Example 2 . 1 1 (b) 

The following table lists the value of the KSTG 
ccmponent used to derive the right subvector and the value 
of the LSIG ccmponent used to derive the left subvector for 
45 of the 18C states. 



NS 


KSTG 


LSTG 


NS KSTG 


LSTG 


NS 


KSTG 


LSTG 


1 


0 


112 


16 


10 


121 


80 


4 


222 1 


2 


0 


121 


21 


10 


222 


81 


13 


1122 


3 


0 


122 


22 


2 


112 


91 


22 


1122 


4 


0 


211 


28 


2 


222 


1 1 1 


40 


1122 


5 


0 


212 


35 


1 1 


222 


120 


40 


222 1 


6 


0 


221 


42 


20 


222 


121 


5 


11222 


7 


0 


222 


49 


3 


222 


122 


5 


12122 


8 


1 


112 


56 


12 


222 


123 


5 


12212 


9 


1 


121 


63 


21 


222 


130 


5 


2221 1 


10 


1 


122 


70 


30 


222 


140 


14 


2221 1 


1 1 


1 


211 


71 


4 


1122 


150 


23 


2221 1 


12 


1 


212 


72 


4 


1212 


170 


41 


2221 1 


13 


1 


221 


73 


4 


1221 


178 


50 


22112 


14 


1 


222 


74 


4 


1222 


179 


50 


22121 


15 


10 


112 


75 


4 


2112 


180 


50 


2221 1 




New, consider how 


to determine 


the 


state 


number fr 


the 


vector 


representation 


of a 


state . 


The 


right 


subvector 


can 


be translated 


into an 


integer and 


located in 


KSTG. Th 


search can 


be simplified 


by use 


1 of NSTG1. 


From 


the right 


sub vector 


can be determined the 


1 appropriate 


length, k, of 


the 


left s 


ubvector 


. The 


first 


k components 


of the ISTATE 



vector representation is the .left subvector. This can be 
translated into an integer value and located in LSTG. Use 
of NSTG2 will simplify this search. Unfortunately, a simple 
formula such as (2.1) cannot be applied at this point. In 
contrast to the key example, the number of states associated 
with each ccmponent of KSTG prior to the component under 
consideration may not be constant. However, this difficulty 
can be overccme since this number is constant within the 
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groups of KSIG. That is, the sum of the digits of a 
ccmponent of KSTG, say s, determines the length, Jc, of the 
left subvector which, in turn, determines the number of 
states associated with that component of KSIG. Thus, the 
same number of states is associated with each component of 
KSIG having the same value for s. An auxiliary vector, 
NSTG4, of length N can be used to store the total number of 
states associated with all components of KSTG prior to each 
group of ccmpcnents of KSTG. The information in NSTG1 and 
NSIG2 is all that is necessary to determine the values of 
the components of NSTG4. Algorithm 6 of Appendix B 
accomplishes this task. 

The state number can now be determined from the 
vector representation. Suppose that the sum of the 
components of the right subvector is s and the right 
subvector corresponds to KSTG ( j) , where j = 0 if s = 0 . (If 
s # 0, the search for j should take place between components 
NSIGI (s) and NSTG1(s+1) - 1 of KSTG.) From s we determine 
that the length of the left subvector is k = max(s,N-l,). 
Suppose that the left subvector corresponds to LSTG (i) , 
where i = 0 if k = 0. (If k # 0, the search for i should 
take place between components NSTG2(k) + 1 and NSTG2(k+1) of 
LSIG.) The state number is given by: 

state number =[j- NSTG 1 (s) ]x[ NSTG2 (k+ 1) - NSTG2 (k) ] 

+ [i - NSTG2 (k) ] + NSTG4(s) 

where NSTG1 (0) = 0, NSTG2 (0) = 0 and NSTG4 (0) = 1 if k = 0 
and NSTG4 (0) = 0 otherwise. 

Example 2 . 1 1 (c) 

Using Algorithm 6, NSTG4 = {7,21,42,70,120}. (Note 

that all of these states are in the table in part (b) of 



this example.) Consider calculation of the state number for 
ISTATE = {1, 2, 1, 2,2,3, 1} . Note that s = 3 + 1 = 4, NSIG1(4) 
= 10, k = max(4,5-2) = 4, NSTG2(5) = 17 and NSTG2(4) = 7. 

The right subvector corresponds to 31 = KSTG(13), so j = 13. 
Since k = 4, the left subvector is 1212 = LSTG(9), so i = 9. 
Finally, NSTG4 (4) = 70. So the state number is 

[13 - 10]x[17 - 7 ] + [ 9 - 7 ] + 70 = 102 

Now suppose that the multiserver gueue is one of the 
PP's rather than the CPU, Then all of the above can be used 
if the left subvector is redefined to consist of "those of 
the first N components of the ISTATE vector representation 
representing jobs not in service at PPj," where PPj is the 
multiserver gueue. 

A more difficult case is one in which two or mere 

processors are multiserver queues. For example, consider a 

case in which PP1 has L servers and the CPU has L servers 

1 C 

(both larger than cne) . Conforming to the above procedure, 

define the left subvector to consist of "those of the first 
N components of the ISTATE vector representation 
representing jobs not in service at either PP1 or the CPU." 

A new problem arises in this case. This problem is 
illustrated in the following example. 



Example 2.12 

Consider a case in which N =4, N =4, h=3 and PPl 

1 2 

and the CPU are IS queues. Then associated with a left 
subvector = {1, 1,2,2} and a right subvector = {2,3,1} are 
three states. Note that rwo I's and two 2‘s are distributed 
between PPl and the CPU, two jobs going to each processor. 
The three states arise by assigning both type one jobs, or 
both type two jobs, or one job of each type, to the CPU. 



Ill 



This problem may be handled by the addition of a 
single component on the right end of the left subvector 
specifying, for example, the number of type one jobs in 
service at PFI. However, since the maximum value of this 
added component depends upon the value of the first 
component of the right subvector, some of the resulting 
values in the appropriate section of LSTG may need to be 
bypassed for some right subvectors. This fact, along with 
the multiplicative effect that addition of such a component 
has on the length of LSTG and the fact that further 
components must be added as the number of multiserver queues 
increases, makes separate maintenance of this information a 
more attractive option. Such separate maintenance might 
involve creation of another storage vector, say MSTG, and an 
associated pointer vector, say NSTG5. The details of this 
procedure will not be discussed here. 

Note that this problem disappears if only one type 
of job can route to each multiserver PP. This is the 
subject cf the next subsection. 

3.5. A Sp ec ia liz ation: Zer o Bran chi ng Proba bilit ie s 

Consider a special case of the key example in which 
ALfA(1,1) = 0; i.e., only type-two jobs are allowed to route 
to PP1. As a result, the order of job types at PPl is not 
needed in specification of a state. In other words, nc 
information concerning PPl need be retained in the left 
subvector. The left subvector may be defined to consist of 
"those of the first N components not representing jobs at 
PPl." In such cases, variable length left subvectors as 
discussed in subsection 3.4 are quite useful. In fact, if 
only one type of job has zero branching probabilities , some 
of the procedures discussed above may be simplified. As an 
alternative, the normal procedures for the key example as 
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1 ^ 











discussed in subsection 3.1 and 3.2 could be followed 
ignoring the fact that many of the represented states are 
invalid. The final steady-state probability distribution 
will assign zero probability to each of the invalid states. 

As a complication, suppose (in addition to ALFA (1,1) 
= 0) that the CPU is a multiserver queue. Then a subtle 
problem 'arises. Consider the following example. 



Example 2.13 

Suppose ALFA (1,1) = 0 and the CPU is an IS queue. 

Consider using the ISTATEmI techniques discussed earlier in 

this section. Defining the left subvector to consist of 

those of the first N components of the ISTATE vector 

representation not representing jobs at PP1 or jobs at the 

CPC, the length of the left subvector is the sum of the 

components of the right subvector less the first component. 

Suppose N = 4, N =3 and M = 3. Then the portion of LSIG 
1 2 

containing four-digit numbers is: 

{1 1 11 , 1 112 < 1 121 , 1 122, 12 11 , 1 212 , 1221 , 1222 , 2111 ,21 12 , 
2121,2 122 , 2211 , 2212 , 2221 } 

Each of these components can be used to generate state 
vectors if the right subvector is {0,3,1}. However, 
consider the right subvector {3,3,1}. The length of the 
appropriate left subvectors is again four. But, the fact 
that three of the four type-two jobs must be located at PP 1 
forces frcm consideration any left subvector containing more 
than one 2. Thus, the components of LSTG usable in this 
case are; 

{ 1111 , 1112 , 1121 , 1211 , 2111 } 

Note that it is necessary to skip around in LSTG. 

To take care of this problem, another vector 
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representaticn, hereafter called the ISTATEm2 vector 
representaticn , has been developed. Although its 
application is more important for cases such as that 
discussed in Example 2.13, the ISTATEm2 vector 
representation is introduced here in terms of the key 
example sc that comparison with the I3TATE and ISTATEml 
vector representations may be clearly seen. 

The ISIATEm2 vector representation of the states of 
the key example is a vector of length N + M + 2 whose first 
component is the sum of tiie components of the right 
subvector, whose nexr a components form the right subvector, 
whose next component is the total number of type-two jobs at 
the PP's, and whose last N components form the left 
sufcvector. The constraints used in the lexicographic 
ordering procedure for these vectors are: 



IS 

Hule 1; The 
integer, k, n 
Hule 2: The 

integers whos 
Hule 3: The 

integer which 



IATSm2 Sequencing Constraints 
first component is a nonnegative 
o larger than N. 

next a components are nonnegative 

e sum is the first component. 

next component is a nonnegative 

can be no smaller than k - N and 

1 



no 



larger than the smaller of k and N . 

2 

Rule 4: The last N components consist of N ones 

1 

and N twos. The first k of these contain exactly 
2 

the number of twos specified by the (M + 2)-nd 
component (see Hule 3) . 
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Example 2.14 



Again consider the key example wirh N = 2, N =2 and 

1 2 

U = 2. (See Examples 2.7 and 2.8.) The following table 
shews 13 of the 90 states with their state numbers as 
provided by a lexicographic ordering of their ISTATEra2 
vector f epresentatiens. For reference purposes all three 
vector representations discussed thus far are listed for 



each state. 






NS 


ISTATEm2 


ISTATE 


ISTATEmI 


1 1 


[0,0,0,0,1,1,2,21 




[1,1,2,2,0,01 




[0,0,0, 1, 1,2,2} 


35 1 


2, 2, 0,1, 2, 1,2,1 




2, 1,2,1, 2,0 




2, 2, 0,2, 1,2,1} 


36 i 


2, 2, 0,2, 2, 2, 1,1 




2,2,r,i,2;o- 




■2,2,0,2,2,i;r 


37 1 


3, 0,3, 1,1 ,1,2,2 




1,1,2, 2,0, 3 




3, 0,3,1, 1,2,2} 


38 1 


3,0,3,1,1,2, 1,2; 




1,2,1,2,0,31 




■3,0,3, 1,2,1 ,2} 


39 1 


3, 0,3, 1,2, 1,1,2 




2, 1,1, 2, 0,3 




3, 0,3, 2, 1,1,2} 


40 


3, 0,3, 2, 1,2, 2,1 


; 


1,2,2, 1,0,3 




3, 0,3, 1,2, 2, r 


41 ) 


3, 0,3, 2, 2, 1,2,1 




2,1,2,1,0,31 




3, 0,3, 2, 1,2,1 


42 


3, 0,3, 2, 2, 2, 1,1 




2,2,1,1,0,33 




3,0,3,2,2,1,11 


43 ] 


3, 1,2, 1,1, 1,2, 2 




1,1, 2, 2, 1,2 




3, 1,2,1, 1,2,2} 


45 


3, 1,2, 1,2,1, 1,2 


’ 


2,1, 1,2,1, 2 




3, 1,2,2, 1,1 ,2} 


46 


3,1 ,2, 2, 1,2, 2,1 




1,2,2, 1, 1,2 




3,1,2,1,2,2,r 


90 


4,4,0,2,2,2,1,11 




2, 2, 1,1, 4,0; 




4, 4, 0,2, 2, 1,1; 



In this example there is little difference between 
the order of the states when the sequencing procedure is 
based on the ISTATEmI vecror representation and when it is 
based on the ISTATEm2 vector representation. An occasional 
pair of states (such as states 39 and 40) are reversed in 
order. In the following example the difference is more 
pronounced. 



Example 2.15 

Consider the key example in which N = 4, N =3 and H 

12 

= 2. The following table shows 24 of the 1260 states with 
their state numbers as provided by a lexicographic ordering 
of their ISTATEm2 vector representations. Note, for 
example, that for state number 561, the first component of 
the ISTATEm2 vector representation indicates that there are 
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five jobs at the PP's. The next two components indicate 
that one cf these is at PP1 and the remaining four are at 
PP2. The fourth component indicates that only one of the 
five is a type-two job. The remainder of the vector 
indicates that the job at ?P1 is a type-one job, the 
type-two job is the one in service at PP2, and two type-two 
jots are at the CPU, 



NS 






ISTATEm2 


525 1 




,4 


,0,3, 2, 2,2 


,1/1, 1/1} 


526 


5 


/O 


,5, 1, 1,1, 1 


/1/2,2,2} 


527 I 


5 


/O 


/5, 1, 1, 1, 1 


/2, 1,2,2' 


528 


’5 


/O 


,5, 1,1, 1,2 


/ 1/ 1/2,2' 


529 


5 


/O 


/5, 1,1,2, 1 


/ 1/ 1/2,2' 


530 i 


‘5 


/O 


,5,1,2, 1, 1 


/ 1/ 1/2,2 


531 1 


'5 


/O 


,5,2, 1, 1, 1 


/2,2,1 ,2| 


53 2 ] 


’5 


/O 


,5,2, 1,1,1 


,2,2,2, 1' 


533 1 


'5 


/O 


,5,2,1, 1,2 


/1/2,1,2' 


53 4 


’5 


/O 


,5,2, 1,1, 2 


/ 1/2,2, 1' 


535 i 


'5 


/O 


,5, 2,1, 1,2 


/2,1,1,2' 


536 


‘5 


fO 


,5, 2, 1, 1,2 


/2, 1,2, 1} 


537 i 


;5 


/O 


,5, 2, 1,2,1 


/ 1/2,1 ,2' 


538 i 


5 


fO 


,5, 2, 1,2, 1 


/ 1/2,2, 1 


539 j 


'5 


/O 


,5, 2, 1,2,1 


/2' 1,1 ,2j 


541 i 


‘5 


,0 


,5,2, 1,2,2 


/ 1/ 1, 1/2' 


543 1 


’5 


/O 


,5, 2,2, 1, 1 


/ 1/2,1 ,2' 


549 


'5 


,0 


,5, 2,2,2, 1 


/ 1/ 1/1/2' 


550 i 


5 


/O 


,5, 2,2,2, 1 


/I, 1,2,1' 


551 1 


'5 


/O 


,5,3, 1, 1,2 


/2,2,1, r 


552 1 


'5 


,0 


,5 , 5, 1 , 2, 1 


/ 2 , 2 , 1 ,r 


553 \ 


^5 


fO 


,5, 3, 1,2, 2 


/ 1/2, 1, 1 


560 


[5 


,0 


,5, 3, 2,2,2 


/1/1/1/1' 


56 1 1 


5 


, 1 


,4, 1,1,1, 1 


/ 1/2,2, 2' 



Despite the introduction above, the ISTATEm2 vector 
representation is not particularly useful for the key 
example. It is more useful in cases in which rhe left 
subvector has variable length. 

Consider the case in which ALFA (1,1) = 0 and the CPU 
is a multiserver gueue with L > 2 servers. The left 
subvector contains the job types at PP2, ... , PPM and 
engueued at the CPU. The first M + 1 components of the 
ISTATEm2 vector representation of states of the system are 
defined as above. Redefine the next componenr to be the 
total number of type-two jobs at PP2, ..., PPM and enqueued 
at the CPU, cr equivalently, the total number of twos in the 
left subvector. This is followed by the left subvector and 
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whatever zeros are required to complete the N + M + 2 
components. Rules 3 and 4 must be changed to conform to 
this redefinition. The fact that only type-two jobs can 
route to forces a change in Rule 2 also. The complete 

set of rules is; 



ISIATEffl2 Sequencing Constraints 
Rule 1; The first component, k, is a nonnegative 
integer no larger than N. 

Rule 2; The next M components are nonnegative 

integers, the first of which is no larger than N . 

2 

The sum of these components must be k. 

Rule 3; The next component is a nonnegative 

integer which can be no smaller than the larger of 

k - n - N and N - n - L, and which can be no 
11 2 1 

larger than the smaller of N - n and m, where n 

2 1 1 

is the second component (i. e., the number of jobs 

at Pfi) and m = k - n + max(0,H-k-L) is the 

1 

length cf the left subvector. 

Rule 4: The next m components consist solely of 

ones and twos. The number of twos among the m 
components is the value of the (M + 2)-nd 
component (see Rule 3) . The last N - m components 
are all zeros. 

As can be deduced from our willingness to change the 
description cf a vector representation to meet specific 
situations without changing the name of the representation, 
the name cf a vector representation does not denote a 
specific set of rules. Rather, it denotes a type of 
ordering of the states. For the ISTATE vector 
representation, rhe right subvector is varied for each value 
of the left subvector. For the ISTATEmI and ISTATEm2 vector 



representations, the left subvector is varied for each value 
of the right subvector. For the ISTATSmI vector 
representation, the appropriate left subvectors are arranged 
in lexicographic order for each right subvector. As seen in 
Example 2.13, this arrangement is not convenient for some 
applications. The following example shows the I3TATEm2 
arrangement of left subvectors for each right subvector in a 
specific group for the case discussed in Example 2.15. 



Example 2.16 

Consider again the case in which ALFA (1,1) = 0, the 

CEO is an IS queue, H = 4, N =3 and M = 3. For the case 

1 2 

of lexicographic sequencing according to the ISTATEm2 vector 

representation, the four-digit components of LSTG are: 

{1 1 11 , 1 112 , 1 1 21 , 121 1,21 1 1 , 1 122 , 1212 , 1 221 , 2112 , 2121 , 
2211 , 1222 , 2122 , 2212 , 2221 } 

Note that for the right subvector {0,3,1} each of these 
components of LSTG will provide us with a valid state. For 
the right sutvector {2,3,1}, only the first five components 
of LSTG will yield valid states. The advantage of the 
ISTATEm2 vector representation is that these five components 
are in consecutive storage locations. Compare this with the 
result of Example 2.13. With right subvector {1,3,1} we 
associate the first eleven components of LSTG and with 
{3,3,1}, only the first one. 

From Example 2.16 it is apparent that procedures 
are needed for storing the numbers of LSTG in the 
appropriate order and for efficient determination of which 
components of LSTG to use with each right subvector. For 
storage of the numbers in LSTG according to a lexicographic 
ordering of the ISTATEm2 vectors, a modification of the 
algorithm given for the ISTATEmI case is possible. 



Algorithm 7 c£ Appendix B provides an alternative to such a 
modification. First, note that because of the way LSTG is 
used, it does not matter whether all of the one-digit 
numbers are followed by the two-digit numbers, and so forth 
up to the N-digit numbers, or the reverse, or any other 
arrangement. It is simply necessary to be able to quickly 
determine the beginning and end of each group. In view of 
Example 2. 16 it is apparent that determination of where the 
number of ones (or twos) changes, even if the number of 
digits does not, is also required. In Algorithm 7 the 
components of LSTG are arranged so that the numbers with the 
largest number of digits are first and the smallest last. 
NSTG3 is used to store pointers to the locations in LSTG 
where either the number of digits or the number of ones 
changes. NSIG2 is used to store pointers to the locations 
in NSTG3 where pointers to a change of digits are stored, 
fiecall that the case under consideration is the key example 
except that ALFA (1,1) = 0 and the CPU is a multiserver queue 

with L servers. In particular, all processors except PP1 
are assumed to accept both types of jobs. 

Example 2 . 17 (a) 

For N = 2, N =2 and L > 2, Algorithm 7 yields: 

1 2 

LSTG = (1122,1212,1221,2112,2121,2211,112,121,211, 
122,212,221,1 1 , 12,21,22, 1 ,2J 
NSTG3 = {1,7,10,13,14,16,17,18,19} 

NSTG2 = {7 ,4, 2,1} 

Note that for j = 1,2,.. .,8, all numbers in LSTG from 
component NSTG3(j) through NSIG3(j + 1) - 1 have the same 

numbers of digits and the same number of ones. Furthermore, 
for i = 1,2, 5, 4, LSTG (NSTG3 (NSTG2 (i) ) ) is the first i-digit 
number stored in LSTG. 

Cnee LSTG, NSTG2 and NSTG3 have been developed using 
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Algorithm 1 , the components of a right subvector contain the 
informaticD required to determine which components of ISTG 
contain valid left subvectors for that right subvector. For 
example, if n^ is the first component of the right subvector 

(i.e., the number of jobs at PP1) and k is the sum of its 

components, then m = k - n + max(0,N-k-L) is the length of 

1 

each valid left subvector, j = min(m,N ) is the largest 

1 

feasible number of ones, and i = max(0,m-N +n ) is the 

2 1 

smallest feasible number of ones. Note that, once m has 

been determined, j is independent of n . Thus, the valid 

1 

left subvectcrs begin in component NSTG3 {NSTG2 (m) ) of LSTG 
and run through component NSTG3 (NSTG2 ( m) + j - i) - 1. 

Sc far the fact that Rule 2 restricts the value of 

the first component of a right subvector to be no larger 

than N has been neglected. As a result Algorithm 3 fcr 
2 

cooputat icnal generation of KSTG and N3TG1 must be revised. 

Briefly, if the sum of the components of the right subvector 

is N + d, then the number of components in KSTG 
2 

corresponding to this sum of components must be reduced by 

C . This correction can be made easily in Algorithm 

M+d-2,d-1 

3. 



Example 2.17 (b) 

If M = 3, the modified algorithm for right subvectors 
yields : 
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KSTG 



= {1, 10^ 100, 2, 1 1^20, 1 Cl , 110, 200,3, 12,21, 30, 
102,111,120,201,210,4,13,22,31,40, 103, 112, 

121, 130,202,211 ,220} 

and 

NSTG1 = {1,4, 10, 19,31} 

,At least two options are available for calculation 
of state numbers from vector representations. A vector 
NSIG4 which has the same length as KSTG could be used to 
store, for example, in NSTG4 (i) , the number of states prior 
to the first state using the corresponding right subvector, 
KSTG (i) . Given a state vector in this case, the length, m, 
of the left subvector, the value of i such that the right 
suhvector is stored in KSTG (i) and the value of j such that 
the left subvector is stored in LSTG(j) would be determined. 
The state number is then 

state number = NSTG4 (i) + j + 1 - NSTG3 (NSTG2 (m) ) 

As an alternative NSTG4 could be shortened by using 
the fact that the number of states associated with each 
right suhvector changes when either the sum of the 
components cf the right subvector or the value of its first 
component changes, A vector, say NSTG5, could then be used 
to point to the components of KSTG where either the first 
component or the sum of the components of the stored right 
sutvectors changes. NSTGI would be modified to point to 
NSTG5 the same way that NSTG2 points to NSTG3. NSTG4(i) 
would contain the number of states prior to the first state 
using the right subvector stored in KSTG (NSTG5 (i) ) . Thus, 
the length of NSTG4 would be the same as that of the new 
vector NSTG5, making this procedure less efficient (in terms 
of storage r eguire ments) unless the length of NSTG4 was cut 
at least in half. In addition, the work required to 
calculate the state number would be increased. The equation 
to be used is: 
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state number = NSTG4(i) + [n + 1 - NSTG3 (NSTG2 (m) ) ] 

+ [j - NSTG5 (i) ]X[ NSTG3 (k+1) - N STG3 (NSTG2 (m) ) ] 

where j is the integer such that the right subvector is 
stored in KSIG ( j) , i is the largest integer such that 
NSIG5 (i) , < j , n is the integer such that the left subvector 
is stored in ISTG (n) , m is the length of the left subvector, 
and k is the largest integer such that LSTG (NSTG3 ( k) ) is a 
legal left subvectcr for use with the given right subvector. 

3.6, Gen eral iz at ion of Queuing Di s cip line 

ICFS, LCFSPR, LCFSPRpt: The state vectors are the 

same as in the FCFS case. Only the interpretation differs. 
Thus, unless the service distributions are ncnexponent ial, 
or blocking cr bulking is possible, storage and sequencing 
of states and determination of state number from vector 
representation is as discussed in the preceding subsections. 

Preemptive Priority: The order of job types at any 

processor having this queuing discipline need not be 
considered a part of the left subvector. What is needed is 
the number of each type of job at each of these processors. 
Hence, cases such as these can be handled the same way IS 
queues are handled. 

Ncnpreemptive Priority: In addition ro the 

information required for preemptive priority queues, the 
type of job in service is required. This may be handled by 
treating nonpreemptive priority queues the same way 
preemptive priority queues are handled but including the job 
type of the job in service in the left subvector. A 
complication arises in this case which does nor arise in rhe 
preemptive priority case if the queue is a multiserver queue 
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as well. The types of jobs in service cannot be added ro 
the left sub vector since their order is immaterial. 

Instead, the number of type one jobs, for example, in 
service at each such queue should be appended to the 
subvector which contains information about the number of 
each type of job at each priority queue. 

PS; In effect a PS queue is an IS queue with 
state-dependent service rates. PS queues may be handled in 
exactly the same way that IS queues are handled. 



3.7. Gene ral ization of Service Distri butions 

Unfortunately, che relatively compact storage 
schemes and elegant procedures for sequential generation of 
the states and for determination of state numbers from 
vector representations presented earlier in this section 
extend to cnly a few specific cases when nonexponential 
service distributions are introduced. In this subsection a 
procedure for handling many general situations will be 
suggested. 

Consider a model in which there are generalized 
Erlangian service distributions such as those discussed in 
subsection 4.1 of Chapter I. Then, an array NUM must be 
maintained as a part of the model description. NUM(i,j) 
should be the number of stages associated with the service 
distribution of type-i jobs at processor j. In order to 
completely describe a state of the system, a number, say K, 
of components must be appended to any vector representation 
to contain information concerning stages of service. 
(Depending upon the situation, K may vary from one to 2 x 
N.) Hereafter this collection of K components will be 
referred to as the far subvector. In all three vector 
representations discussed above, the far subvector can be 
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attached to the right end and rules to govern the selection 
of values for its coapcnents can be added. 



Example 2.18 

Consider the case in which each processor is a FCFS, 
single server queue with generalized Erlangian service 
distributions. Then the far subvector is of length a + 1. 
The j-th component of the far subvector contains the stage 
of service f cr the job in service at processor j. The 
convention will be to assign the value of one to each 
component which corresponds to an idle processor. For a 
lexicographic ordering of the ISTATS vectors for this 
system, the following rules should be added to the two rules 
in subvector 3.1: 

Rule 3: The next M components are positive 

integers such that the j-th of these is no larger 
than one if ISTATE(N+j) = 0, and otherwise is no 
larger than NUM(i,j), where i is the value of 
component [ISTATE(N + 1) + ISTATE (N+2) + ... + 

ISTAIE(N + j)] of ISTATE. 

Rule 4: The last component is a positive integer 
which is no larger than one if the sum of the 
components of the right subvector is N, and 
otherwise is no larger than NUM(i,M+1) , where i = 

ISTATE (K) . 

Since the far subvector is at the right end of any 
vector representation of the states, a choice of left and 
right subvectors leads to a group of states. Thus, to 
sequentially generate all of the states, it is necessary 
only to sequentially generate the left and right subvectors 
as described in the preceding subsections and, each tiie 
either the left or the right subvector changes, generate all 
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applicable far subvectors. The right and left subvectors 
(and associated pointers) can be stored as described in the 
preceding subsections according to the sequencing procedure 
used. The mcdus operand! will be to not store the far 
subvector. Each time the states are (sequentially) 
considered, generation of the associated far subvectors may 
be accomplished by performing the lexicographic sequencing 
procedure (see subsection 3.1) using constraints similar to 
Rules 3 and h in Example 2.18. 

The remaining question is how to determine state 
numbers from vector representations. (ISTATEmI or ISTATEm2 
sequencing will be assumed.) To aid in this effort create a 
vector, NSTG4, having the same length as KSTG. NSTG4(i) 
will be the number of states preceding the first state 
having KSIG (i) as its right subvector. Now suppose a right 
subvector and a left subvector which is valid for the right 
subvector are given. Create a vector, MAX, of length K (the 
same length as the far subvector) sucn that MAX (k) is the 
largest value the k-th component of the far subvector can 
take on subject to the constraints imposed by the given 
right and left subvectors. In many models the components of 
the far subvector can take on values independently of each 
other. That is, the values that any given component can 
take on are not restricted in any way by the values the 
other components have. For these models, each vector from 
the vector of all ones through MAX is a feasible value for 
the far subvector. (If A and 3 are vectors of the same 
length, A < E means that each component of A is less than or 
equal to the corresponding component of B. This last 
sentence means: vector of all ones < A < MAX implies that A 
is a valid fax subvectcr for the given right and left 
subvectors. Ey construction of MAX and convention for far 
subvectors, the reverse implication is also true.) Thus, 
the number of states having the given right and left 
subvectors is the product of the components of MAX. 
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Given a particular far subvector, MAX may be used 
also to indicate its relative position among all far 
sutvectors which are valid for the given right and left 
subvectors. Algorithm 8 of Appendix 3 specifies how this may 
be done. Algorithm 9 in the same appendix may be used to 
determine the state number of a state from its vector 
jrepresentat icn for models like those discussed in the 
preceding paragraph. 

for cases in which the components of the far 
sutvector are not independent of each other, decompose the 
far subvector into subvectors such that the components are 
independent of each other within each subvector but 
components of all subvectors but the first depend upon the 
values in the preceding subvector (s) . Then attempt to 
iterate the above process. This, of course, is a very 
complicated procedure at best. 

3.8. Fini te Capacities and Block ing 

The model considered in this subsection is the key 
example except that the capacity of the CPU is C jobs, where 
C is a positive integer smaller than the total number, N, of 
jobs circuiating in the system. (If C > N, the CPU 
effectively has infinite capacity since no blocking can 
occur.) Other possible models are left to the reader. 

Since the CPU has a finite capacity, the potential exists 
for blocking to occur at any of the PP's. As discussed in 
subsection 2.6, M components will be added to the vector 
representation of a state, one for each PP to indicate its 
blccked/nct blocked status. Hereafter these M components 
are called the blocking subvector. 

One way to handle this situation is to set a 
component of the blocking subvector equal to two if the 
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corresponding EP is blocked and one if it is not, and use 
the procedures discussed in the last subsection. However, 
since the possible values of the components are independent 
of the left subvector, a more efficient scheme is possible. 

Using the terminolgy of subsection 2.6, assume that 
a priority cr probablistic scheme is being used to determine 
which of the blocked PP's will be unblocked when a service 
ccmpleticn takes place at the CPU. In particular, set the 
j-th compcnent of the blocking subvector egual to one if PPj 
is blocked and zero if not. 

The ISIATSml and ISTATEm2 vector representations 
(with the blocking subvector at the right side) are 
preferred over the I3TATE vector representation for twc 
reasons. First, once the right subvector has been 
specified, the whole range of possible blocking subvectors 
can be determined. Second, the fact that the CPU has a 
finite capacity implies that there is a positive minimum 
feasible value for the sum of the components of the right 
subvector, i.e., the first component of the ISTATEmI and 
ISIATEm2 vector representations. 

Suppose that a right subvector is specified. If the 
sum of its components is greater than N - C, no blocking can 
occur. Sc consider rhe case in which this sum equals S - C. 
If the j-th ccmponent of the right subvector is zero, so is 
the j-th compcnent of the blocking subvecror since an idle 
processor cannot he blocked. Removing these components from 
the blocking subvector, we are left with a blocking 
subvector whose length is the number of nonzero terms of the 
specified right subvector. Each component of this shortened 
blocking subvector can independently take on either of the 
values zero cr one. Using the techniques for storage and 
handling of variable-length vectors discussed in subsections 
3.4 and 3.5, these shortened blocking subvecrors could be 
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stored for later use. Indeed, this is necessary if a FIFO 
or LIFO unblocking scheme is used. But in the present case 
the fact that the stored numbers are binary can be used to 
avoid storing any information at all concerning the blocking 
subvectors. If the right subvector has m nonzero 

m 

components, there are 2 feasible (shortened) blocking 
subvectors, and they are represented by binary 

m 

representations of the integers from zero through 2 - 1. 



Example 2.19 



Suppose N 




4 and C 



3. Consider the 



right subvector {2, 1,0,1}. Since the sum of the components 
of the right subvector is equal to N - C, the blocking 
subvector may take on nonrrivial values. The shortened 
blocking subvector has three componenrs whose values 
indicate whether or not PP1, PP2, and PP4, respectively, are 
blocked. A value {1,0,1} indicates that PP1 and PP4 are 
blocked but PP2 is not. The following table lists the 
feasible (full) blocking subvectors in lexicographic order 
and the number which would be stored to represent each one. 

clocking stored 

subvector number 



0 , 0 , 0,01 

0 , 1 , 0, 0 
0 , 1 , 0 , 1 

1 , 0 , 0, 0 

1 , 0 , 0 , 11 

1,1 o;oj 

1 , 1 , 0 , 1 } 



0 

1 

10 
1 1 
100 
101 
110 
11 1 



Note that the third component of the blocking subvector is 
zero since tte third component of the right subvector is 
zero, i.e., PP3 cannot, be blocked since it is idle. Also 



3 

note that there are 2 =3 feasible blocking subvectcrs and 



that their corresponding stored numbers are the binary 
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3 

representations of the integers 0,1, 2,. ..,7 =2 - 1. This 

is because three is the number of nonzero components in the 
right sufcvectcr. 

As remarked above, the blocking subvectors need not 
be stored. In fact, since the total number of jobs at the 
PP's may’ not te smaller than N - C, the size of KSTG can 
usually be reduced. Sequential generation of the states is 
accomplished by successively setting the right subvectcr to 
the values indicated in KSTG, for each right subvector 
successively setting the left subvector to the feasible 
values from LSTG, and for each left subvector successively 
setting the blocking subvector to its feasible values. This 
last step is accomplished (in those cases in which blocking 
is possible) by successively setting the blocking subvector 
equal to the binary representation of the integers from zero 
m 

to (2 - 1) where m is the largest feasible number of 

blocked PP's. 



Various options are available for determining state 
numbers from vector representations. First, note that the 
number of states corresponding to a given right subvector is 
equal to the number of left subvectors valid for that right 
subvector if the sun of the components of the right 

m 

subvector is larger than N - C; it is equal to 2 times the 

number of valid left subvectors (where n is the number of 
nonzero components in the right subvector) if this sum 
equals N - C. 



One procedure would be to store in NSTG4 (i) the 
number of states preceding the first state having KSTG (i) as 
its right subvector. A straightforward algorithmic 
procedure (see Algorithm 10 in Appendix B) could then he 
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used to determine the state number from the vector 
representation. Of course, NSTG4 can be shortened since the 
number of states having a given right subvector can be 
determined from NSTG2 and NSTG3 if the sum of the compcnents 
of the right subvector is larger than N - C. It can be 
shortened further if the states are seguenced according to a 
lexicographic ordering of a new vector representation, say 
ISIATEmS', which is similar to the ISTATEm2 (or ISTATEmI) 
vector representation except that the number cf nonzero 
components of the right subvector is inserted (as an 
additional component) between the first two components, 
i.e., betweer the sum of the components of the right 
sutvectcr and the right subvector itself. Since the 
techingues that would be used in this case are similar to 
those developed earlier in this secrion, we will not discuss 
the ISTAIEii3 vector representation further. 



3.9. Bu lking 



A review of subsection 2.7 reveals that the possible 
values of components add to the vector representation to 
handle bulking depends upon both right and left subvectors. 
Hence, the procedures reguired for seguential generation of 
the states and determination of state numbers from vector 
representations are those discussed in subsection 3.7 for 
handling nonexponential service distributions. 

4. GENE3ATICN AND STORAGE OF THE BALANCE EQUATIONS 



Once the states have been appropriately seguenced and an 
efficient computational procedure has been developed to 
translate state vectors into state numbers, the balance 
equations may be generated and stored for subsequent 
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solution, for i = 1, 2, NSTATB, let p be the 

i 

steady-state probability that the system is in state i. 

Then the i-th balance equation has the form 

\ 

(2.2) (rate of transition from srate i) p = 

i 

J (rate of transition from j to i) p 

j 

Dividing (2.2) through by (rate of transition from state i) , 
the balance equations can be written in more compact matrix 
form as 

(2.3) P = AP 

where P is the column vector (p /P ,...,p ) and A is 

1 2 NSTATE 

the matrix with (rate of transition from j to i) divided by 

(rate of transition from i) at the intersection of the i-th 
row and j-th column. 

As will be seen shortly, A is often a very sparse matrix 
(i.e., a large proportion cf its elements are zeros). 

Hence, it is inefficient to store A in matrix form. Rather 
it is suggested that only its nonzero elements and the 
location cf each be stored. This is accomplished by storing 

(i) the k-th nonzero element in COEF(k); 

(ii) the column number for the k-th nonzero 
element in INDEX (k) ; and 

(iii) the number of nonzero elements in the first 
i rows in NCON (i) . 

Thus, equivalent to (2.2) is 
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(2.4) 



F. 

1 



2 COEF ( j) p 



INDEX (j) 



j = HN 



where NN = NCCN(i-l) + 1 (MN = 1 if i = 1) and MX = NCCN(i). 

Generaticn of the balance equations now involves filling 
in the three arrays COEF, INDEX and NCON. To do this the 
following must be determined for each state i: 

(i) the rate cf transition from state i; 

(ii) the various states from which the system can 
transition (in one step) into state i; and 

(iii) the rate of transition from each cf these 
states into state i. 

The rate cf transition from state i (hereafter called 
DIVRAT) is the sum of the service rates of the jobs in 
service at the various processors. The states from which 
the system can transition into state i may be determined 
from the ISTAIE vector representation of state i by 
determining which job (or jobs) could have moved in such a 
transition. The rate of transition in each such case is the 
service rate cf the job which moved. 

4.1. Key Ex am ple 



be a candidate for movement during a transition into state 
i, it must be the leftmost job in the ISTATE listing of jobs 
at some processor. 

Example 2.20 

Let N = 3, N =4, and M = 2. For state i suppose 



Since all processors are FCFS, in order for a job to 



1 



2 
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ISIATE = {1 , 1 ,2 ,2 / 1 / 2 , 2,3 , 3} . Then there is a type-two job 

in service at each of the three processors. Hence, DIVRAT = 
RATE (2,1) + FATE (2,2) + RATE (2,3). (See subsection 1.1 for 

definiticn of RATE and ALFA.) There are four states from 
which the system could transition into state i: 

(i) Frcm state j with ISTATE = 

1 

{1 , 1,2, 2, 1 , 2, 2, 3 , 4} by a type-two job at PP2 

completing service and preceding to the CPU. Rate 
of transition from j^ to i is RATE(2,2). 

(ii) Frcm state j with ISTATE = 

2 

{1 , 1 ,2, 2, 2, 1 , 2, 4 , 3} by a type-two job at PP1 



completing service and preceding to the CPU. Rate 
of transition from j^ to i is RATE (2,1). 

(iii) Frcm state j^ with ISTATE = 

{1 , 1, 2, 1, 2, 2, 2, 3, 2} by a type-two job at the CPU 



completing service and preceding to P?2. Rate of 
transition from j^ to i is RATE (2 , 3) ALFA (2, 2) . 

(iv) Frcm state j with ISTATE = 

4 

{ 1 ,2 ,2, 1 ,2, 2 , 1 , 2 , 3} by a type-one job at the CPU 



completing service and preceding to PP1. Rate of 

transition from j to i is RATE (1 ,3) ALFA (1 , 1) . 

4 

Letting HN = NCON(i-l) + 1, set: 

COEF(llil) = RATE (2,2)/DIVRAT INDEX (MN) 

C0EF(MN + 1) = RATE (2,1) /DIVRAT INDEX(MN+1) 

C0EF(MN + 2) = RATE (2 ,3) ALFA (2,2) /DIVRAT INDSX(!lN + 2) 

C0EF(MN+3) = RATE (1 ,3) ALFA (1, 1) /DIVRAT INDSX(MN + 3) 



= j 
= j 
= j 
= j 



1 

2 

3 

4 
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NCON(i) = MN + 3 

Note that the actual values of i, j , i , j and j may be 

12 3 4 

derived from (2,1) and the respective ISTATE vector 

representations. This example should be used as a guide in 
understanding the following examples which develop the 
balance eguation for a similar state i for various 
generalizations of the key example. 



4.2. Gene ralization of the Number of Jo b T vpe s 

Exactly the same procedures are followed if the 
number of job types is increased beyond two. 



4.3. Gener ali z^i on ^ the Nu®^^ Qt Ser vers 



Example 2.21 

Consider the same situation as Example 2.20 except 
that PP1 is a multiserver queue with two servers. Then 
since two jots are in service at PPI, DIVRAT has a different 
value: 

DIVRAT = RATE(1,1) + RATE(2,1) + RATE(2,2) + RATE(2,3) 

There will aJso be some change in the balance equation. Of 
the four terms listed in Example 2.20, all but (ii) are 
still valid. In the present case it is possible to 
transition into state i from either of the two states by 
having a type-two job complete service at PP1 and route to 
the CPU. Thus, (ii) should be replaced by: 

(ii) from state j wirh ISTATE = 

2 

{1 , 1,2, 2, 2, 1 , 2, 4, 3) by a type-two job at PP1 

completing service and preceding to the CPU. Rate 
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of transition from j to i is RATE(2,1)x2. 

2 

(ii ' ) From state i with ISTATE = 

2 * 

{1 ,2, 1 , 2,2, 1 , 2, 4, 3} by a type-two job at PP1 

completing service and preceding to the CPU. Rate 

of transition from j to i is RATE(2,1). 

2 ' 

4.4. Ge ne ralization of Qu euing Discip line 



With the exception of PS a change of queuing 
discipline at one or more of the processors would not effect 
the value cf EIVRAI. However, any such change will cause 
changes in the balance equations. In the following examples 
the same situation as that examined in Example 2.20 is 
considered with the exception of the queuing discipline 
change noted in each case. The resulting changes to the 
balance equation fer state i are listed. 



Example 2.22 

Suppose that the queuing discipline at PP2 is LCFS. 
Then any job arriving at PP2 (from the CPU) will join the 
front of the line rather than the back as in the FCFS 
discipline. Thus (iii) should be changed to: 

(iii) From state j^ with ISTATE = 

{1, 1,2, 2,2, 2, 1, 3, 2} by a type-one job at the CPU 

completing service and preceding to PP2. Rate of 
transition from j to i is RATS (1 ,3) ALFA (1 , 2) . 
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Example 2.23 



Suppose that type-one jobs have (nonpreempti ve) 
priority over type-two jobs at PP2. Then, if a type-one job 
were to complete service at the CPU and precede to PP2 at a 
time when one or more type-two jobs were enqueued at PP2, 
the type-cne job would enter the line in front of all 
(enqueued) type-two jobs. Thus, in addition to the four 
terms listed in Example 2,20, the balance equation in this 
case would have: 

(V) Frctt state j^ with ISTATZ = 

{1 , 1 , 2 , 2 ,2,2 , 1 , 3 , 2} by a type-one job at the CPU 

completing service and preceding to PP2. Rate of 
transition from j^ to i is RATE ( 1 ,3) ALFA (1 , 2) . 



Example 2.24 

Suppose that type-two joos have preemptive priority 
over type-one jobs at PPl. Then, though a newly arriving 
type-two job (at PPI) would not interrupt another type-two 
job in service, it would interrupt a type-one job in 
service. Hence, tc the four terms listed in Example 2.20 
add ; 

(V) From state j^ with ISTATE = 

{1 / 1#2, 1,2,2,2,2,3} by a type-two job at the CPU 

completing service and preceding to PPI. Rate of 

transiticn from j to i is RATE (2 ,3) ALFA (2 , 1 ) . 

5 

Example 2.25 

Suppose that the queuing discipline at PP2 is LCFSPR 
or LCFSPSpt. Then, in contrast to Example 2.22, (iii) of 
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Example 2.20 must be changed to; 

(iii) From state with I5TATE = 

{1,1,2,2,1,2,2,3,23 by a type-two job at the CPU 

completing service and preceding to PP2. Rate of 
transition from to i is RATE (2 ,3) ALFA (2,2) . 

Both LCF5PE and LCfSPRpt will be discussed in conjuction 
with generalized Srlangian service distributions. 



Example 2.26 

Suppose that the queuing discipline at PP1 is PS. 

Then, the value of DIVRAT must be changed by replacing the 

first term (i»hich is RATE(2,1); see Example 2.20) to 
(1/3) [ 2xRATE (1 , 1 ) + RATE(2,1) ]. Similarly, since the rate 
of transition from to i now depends upon the full 

complement of jobs at PP1 just prior to the transition, this 

rate must be changed to RATE (2,1) /2. 

4.5. Gene ral ization of Servic e Dis tri bu tions 



Example 2.27 

Suppose that the situation is the same as that in 
Example 2.20 except that jobs at PP2 have nenexponential 
service distributions. More specifically suppose that type- 
one jobs have a service distribution which can be modelled 
as three exponential stages and that type-two jobs have one 
which can be modelled as two exponential stages. Let 
RATE(i,2,k) he the rate parameter for the k-th stage of the 
distribution for type-i jobs (k = 1,2,3 if i = 1 and k = 1,2 
if i = 2) and let B(i,2,k) be the probability of a type-i 
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job routing to the cpu immediately after completing stage k 

service at PE2 (k = 1,2 if i =1 and k = 1 if i = 2) . Expand 

the ISTATE vector by one component on the right to indicate 

the stage of service for the job in service at PP2. Suppose 

that in state i, the job in service at PP2 is in its second 

stage of service. Then, for state i ISTATE = 

{1 , 1 , 2, 2 , 1 ,2 ,2 ,3, 3 , 2} and the second term in the calculation 

of DIVRAT becomes EATE(2,2,2). In the generation of the 

balance eguaticn the ISTATE vectors for j , j and j must 

2 3 a 

have the additional component, with value 2, added on the 

right. Since the job in service must have passed through 
stage one, (i) must be changed to: 

(i) Ercm state j with ISTATE = 

1 

{1 , 1,2,2, 1,2, 2,3,3, 1} by a type-two job at ?P2 

completing service and preceding to the second 

stage cf service at PP2. Sate of transition from 

j^ tc i is RATE(2,2,1)[ 1-B(2,2,1) ]. 

If, on the ether hand, the job in service at PP2 in 
state i is ir its first stage of service, then ISTATE = 
{1,1,2,2,1,2,2,3,3 ,1} for state i and RATE(2,2,1) is the 
second term in the calculation of DIVRAT. Again (ii) , (iii) 
and (iv) are as in Example 2.20 except that the ISTATE 
vectors have an addiricnal component on the right side, this 
time containing a 1. However, in this case it is possible 
to transition into state i by having a job complete service 
at PP2 and route to the CPLF. In fact, since type-two jobs 
at PP2 can complete service with either of two stages, there 
are two different states from which this can occur. 

Specif ically , (i) can be replaced by; 

(i) From state j with ISTATE = 

1 

{1 , 1,2 ,2, 1,2, 2,3 , 4,2} by a type-two job at ?P2 
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comfleting service and preceding to the CPQ. Rate 
of transition from tc i is RATE(2,2,2). 

(i') Frcm state j with ISTATE = 

1 1 

{1 , 1 /2 ,2, 1 ,2, 2,3 , 4, 1} by a type-two job at PP2 

completing service and preceding to the CPU. Rate 

of transition from j to i is 

1 ' 

RATE (2,2, 1) B (2,2, 1) . 



Example 2.28 

Suppose that the situation is as described in the 
first paragraph of Example 2.27 except that PP2 has LCESPR 
queuing discipline. Then as discussed in subsection 2.4, 
to the ISTATE vector representation of each state must be 
added N = 7 components to represent the stage of service 
each job was in when it lasr received service (or is 
currently in if it is receiving service). Suppose that the 
type-two job enqueued at PP2 was in its second stage and the 
type-one job in its third stage. Then, for state i, ISTATE 
= ( 1 , 1 , 2, 2 , 1, 2, 2 ,3 , 3, 1 , 1 , 1 , 2 ,3, 2, 1} where the ones for jobs 
not at PP2 are default values. DIVRAT is as described in 
the first paragraph of Example 2.27. The terms of the 
balance equation are determined by considering transitions 
into state i. 

(i) Frcm state j with ISTATE = 

1 

{1,1, 2, 2, 1,2, 2, 3, 4, 1,1, 1,2, 3, 2, 2} by a type— two 

job at EP2 completing service and preceding to the 

CPU. Rate of transition from j tc i is 

1 

RATE (2,2,2) . 
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(i') Ftcm state j with ISTATE = 

1 » 



{ 1 , 1 , 2 , 



1 o 



by a type-two 



job at EE2 completing service and 
CPU. Bate of transition from j 



proceding to the 
to i is 



HATE (2,2, 1) B (2,2, 1) . 

(ii) From state j with ISTATE = 

2 

{1, 1,2,2,^, 1,.^, 4, 3, 1, 1, 1,1, 2, 3, 2} by a t ype— t wo 



job at PP1 ccmpleting service and proceding to the 

CPU. Rate of transition from j tc i is 

2 

RATE (2,1) . 



(iii) From state j with ISTATE = 

3 

{1, 1, 2, z, 1,2, 2, 3, 3, 1,1, 1,2, 3, 1,1} by a type— two 



job at EP2 completing service and proceding to the 
second stage cf service at PP2. Rate of 
transition from j^ to i is 

RATE(2,2,1)[ 1-B(2,2,1) ]. 

(iv) From state j with ISTATE = 

4 

(1, z, 2, 1, 2, z, 1,2, 3, 1,1, 2, 3, 2, 1,1} by a type— one 

job at the CPU completing service and proceding tc 

PP1. Hate of transition from j to i is 

4 

RATE (1,3) ALFA (1,1). 

Note that a transition such as that described in (iii) cf 
Example 2.20 is not possible since any job arriving at PP2 
would immediately enter the first stage of service there. 



140 



Example 2.29 



Suppose that the situation is as in the first 
paragraph of Example 2.27 except that the queuing 
discipline at PP2 is LCFSPHpt. Then, N + 1 = 8 components 
must be added to each ISTATE vector. The first seven of 
these represent the largest stage of service attained at PP2 
by the various jobs since last departing from PP2. (Again 
one is used as a default value.) The last component is used 
to indicate the current stage of the job in service at PP2. 
Suppose that the largest stage attained by the type-twc job 
enqueued at PP2 was two and by the type-one job enqueued 
there was three. Suppose that the job in service at PP2 had 
attained the second stage but was interrupted there and is 
currently in the first stage of service. Then, for state i 
ISTATE = 11,1,2,2,1,2,2,3,3,1,1,1,2,3,2,1,1}. DIVRAT is as 
described in the second paragraph of Example 2.27. The 
terms of the balance equation (compare with Example 2.28) 
are obtained by considering transitions into state i: 

(i) Frcm state j^ with ISTATE = 

{1,1, 2, ^,1,2, 2, 3, 4, 1,1, 1,2, 3, ^,2,1} by a ty pe — two 



job at EE2 completing service and preceding to the 

CPU. Sate of transition from j tc i is 

1 

SATE (2,2,1) B (2,2, 1) . 



(i‘) Ercm 
{ 1 / 1 , 2 , 2 , 



state j with ISTATE = 

V 

1,2,2,3,4,1, 1, 1,2,3,2, 1, 1} 



by a type-two 



job at EP2 completing service and preceding to the 

CPU. Rate of transition from j to i is 

1 ' 



HATE (2,2,1) B(2,2, 1) . 
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(i”) Fxcia state j 



with ISTATE 



1» 

{ 1 , 1, 2, i , 1, 2, 2, 3, 4, 1 , 1 , 1 , 2 , 3, 2, 2, 2} by a type-two 

job at EE2 completing service and preceding to the 

CPU. Rate of transition from j to i is 

1” 

RATE (2,2,2) . 

(ii) Ficm state j with ISTATE = 

2 

{1,1, ^,2, 2, 1,2, 4, 3, 1,1, 1,1, 2, 3, 2,1} by a type— two 

job at EE1 completing service and preceding to the 

CPU. Rate of transition from j to i is 

2 

RATF (2,1) . 

(iii) From state j with ISTATE = 

4 

{ 1 , 2, 2, 1, 2, 2, 1 , 2, 3, 1 , 1 , 2, 3 , 2, 1 , 1 , 1} by a type-one 

job at the CPU completing service and preceding tc 

PP1. Rate of transition from j to i is 

4 

RATE (1,3) ALFA (^, 1) . 



4.6. liti te C apacitie s and Block ing 



Example 2.30 

Suppose that the situation is as in Example 2.20 
except that EP2 has a capacity of three jobs. Then to the 
ISIATE vector for each state add one additional component 
which takes on the value zero if the CPU is not blocked and 
rwo if it is blocked (by a job which has completed service 
at the CPU but found PP2 ar capacity when it tried to route 
there) . Finally suppose that in state i the CPU is not 
blocked sc that ISTATE = { 1 , 1 , 2, 2, 1 , 2, 2,3 , 3 , 0} . DIVHAI has 
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exactly that value specified in Example 2.20. (If the CPU 

were blocked, DIVRAT would be RATE(2,1) + RATE(2,2).) 

Turning new to the balance equation, the last three terms as 

specified in Example 2.20 are still valid with the addition 

of a zero-valued component to the right end cf the ISTATE 

vector for states 1 , j , and j . The capacity of PP2 makes 

2 3 4 

state j impossible. However, if the last enqueued jot at 
1 ' 

PP2 in state were instead blocking the CPU, a transition 

of the type described in (i) of Example 2.20 would cause 

simultaneous movement of this job to PP2 and result in state 

i. That is, (i) can be replaced by: 

(i) Ercm state j with ISTATE = 

1 

£1 , 1,2, 1,2, 2, 2, 3, 3, 2} by a type-two job at PP2 

completing service and preceding to the CPU and 
simultaneous movement of a type-two job from the 
CPU to PP2, unblocking the CPU. Rate of 
transition from j to i is RATE (2,2). 



4.7. Bulki ng 
Example 2.31 

Suppose that the situation is as Example 2.20 except 
that type-cne jobs destined for PP1 bulk at the CPU and 
depart only in pairs. Then, to the ISTATE vector for each 
state add one additional component which indicates the 
number of type-one jobs in the bulking queue at CPU. 

Clearly, fox state i, ISTATE = { 1, 1 , 2, 2, 1 , 2 , 2,3, 3, 0} . 

DIVRAT and the first three terms of the balance eguaticn are 
(with the cbvious change to the appropriate ISTATE vectors) 
as indicated in Example 2.20. In fact, even (iv) is correct 
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if the ISIATi vector for j is changed to 

4 

{2# 2, 1, 2,2, 1, 1 , 1, 3, 1} and the wording is changed slightly to 
indicate that two type-one jobs proceed to PP1. 

5. PROPEBTIES OF THE SYSTEM OF BALANCE EQUATIONS 

In this chapter concepts and techniques designed to aid 
in the solution of the variety of queuing network problems 
have been introduced. Many of these problems do not satisfy 
local balance and therefore are generally not known to have 
simple closed-form solutions, such as the product-form 
solutions. More specifically, this chapter has been 
concerned with how to conveniently specify, arrange and 
store the states of such problems and how to generate tne 
balance equations. Solution of these balance equations is 
the subject of the next chapter. 

The approach has been to introduce the concepts and 
techniques with a view toward eventual implementation on 
high-speed digital computers. The reason is simply that the 
size of the problems encountered in modeling even simple 
systems is very large. For example, the case considered in 
Example 2.20, two types of jobs with three jobs of one type 
and four of the other circulating in a central-server model 
consisting of a FCFS single-server CPU and two FCFS single- 
server PP*s, yields 1260 states and 3920 nonzero entries in 
the transition matrix. Increasing the number of PP's to 
four increases the state space to 11,550 and the number of 
nonzero entries to 58,800. 

Some of t-he generalizations discussed in this chapter 
will decrease the state space and others will increase it. 
For example, giving a PS queuing discipline or multiple 
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servers tc a queue tends to decrease the state space. On 
the other hand, the state space will be increased by 
introduction of mere job types (even though the total number 
of jobs remains unchanged) or by any generalization which 
increases the size of the I3TATE vector, for example, 
blocking, bulking, most preemptive queuing disciplines and 
generalized Erlangian service distributions. 

Aside from potentially massive size, an important 
feature of these problems is the sparseness of the 
transition matrix. This is seen in the examples in section 
4 where the naximum number of nonzero entries in each row is 
seen to be around four or five no matter what the number of 
states is. Two measures of sparseness are the average 
number of nerzero entries per row of the matrix (3920/1260 = 
3.1 and 58,800/11,550 = 5.1 for the examples above) and the 

2 

proporticn cf nonzero entries in the matrix (3920/(1260) = 

2 

.0025 and 58,800/(11,550) = .00044 for the examples 

above) . 

In the opening section of this chapter it was hinted 
that the techniques introduced here could be applied tc a 
more general class of networks than the central-server model 
for which they were introduced. Doing so will increase the 
sparseness measures (i.e., increase the number of terms in 
the balance equations) without affecting the size of the 
state space. The more specific setting of the central- 
server models was used here to simplify the presentation and 
because cf the applicability of the central-server models to 
modelling cf computer systems. 

In addition, some of the other properties of the 
central-server models allow analysis which may not be 
possible in ncre complex models. For example, a central- 
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server model with two job types and at least two PP's is 
guaranteed to be irreducible. If, in addition, all service 
distribu tions are exponential, the resulting Markov chain 
can be shewn to be periodic of order two. These two facts 
are proved in Appendix C. 

The technigues discussed in this chapter have been used 
in programming the key example and two generalizations for 
use on a high-speed digital computer. The program for the 
key example is given in Appendix D. One generalization 
gives the CPD a PS queuing discipline. This program is 
given in Appendix E. The other generalization retains the 
FCES discipline at the CPU but has the capability to make 
the CPU a multiserver processor. In addition, one of the 
PP's in this model has an IS discipline and can accept only 
one type of job. This program is given in Appendix F. All 
three models are discussed in Chapter IV. 

The fact that these three models are central-server 
models with exponential service times guarantees that they 
are irreducible and periodic of order two. This periodicity 
is used to advantage in discussing the convergence 
properties of the solution methods presented in the next 
chapter. 
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III. SOLUTION OF THE MLANCE EQUA TI ONS 



The goal of this chapter is to present a variery of 
procedures suitable for solution of the balance equations 
which result from application of the concepts and techniques 
discussed in the preceding chapter. In view of the 
properties of these systems of equations (see section 5 of 
Chapter II) , the main concern is determination of a 
procedure which is guaranteed to provide a solution for a 
cyclic model. In addition, a procedure which preserves the 
sparse nature of the system will allow solution of larger 
problems . 

In the first section of this chapter the choice of a 
solution method is discussed in the light of these 
consider ations . The resulting choice is an iterative 
method. This section also serves as an introduction to the 
analysis of various solution methods by introducing the 
problem as an eigenvalue/eigenvector problem. The second 
section introduces the notation used in the remainder of the 
chapter and reviews pertinent matrix terminology and 
theorems. The third section examines the eigenstructure of 
the matrices resulting from some of the central-server 
models discussed in the preceding chapter. Finally, section 
4 is devoted to various iterative solution methods and 
conditions under which they are guaranteed to converge. 
Subsection 4.4 indicates the areas in which further research 
maj be useful in accelerating the convergence of the method 
ultimately chosen as preferred for the problems of interest 
to the author. 
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1. CHOICE 01 A SOLUTION METHOD 



The system of balance e^guations discussed in the 
preceding chapter is a system of homogeneous linear 
equations. This fact is most easily seen to be true fiom 
equation, (2.3) which is repeated here: 

(3.1) P = AP 

and which can be rewritten as: 

(3.2) (I - A) P = i 

In these two equations A and I are square n x n matrices 
(where n is the number of states) and P and i are column 
vectors of length n; I is an identity matrix and all 
components of _C are zero. The nonzero elements of A are the 
numners stored in the array COEF as discussed in section 4 
of the preceding chapter. 

(Since tie primary references in this chapter are books 
on numerical analysis, the numerical analysts' emphasis on 
column vectors, rather than the probabilists ' emphasis on 
row vectors, is adhered to hare. As will be seen in section 
3, our expression of the problem is the transpose of that 
normally adopted by the probabalists. Hence, with due 
apology tc these who may be confused, we assure the reader 
that we are investigating the forward equations (see Feller 
[32], Chapter XIV, section 7) and not the backward equations 
as it may at first appear.) 

As a system cf homogeneous linear equations, (3.1) must 
have either an infinite number of solutions or no solution 
at all. Of course, the goal here is to determine a solution 
for (3.1) which satisfies two other conditions: 
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(3.3) 



1 



1 ? = 



where 1. is the row vector of length n all of whose 
components are ones, and 



(3.4) 



P > 0 



Equation (3.3) is called a normalizing condition. 

Two cfticns are open for attacking this problem. First, 
(3.2) and (3.3) could be concatenated and the resulting 
overdeterained system of linear equations could be solved. 
Obtaining a solution (hopefully), satisfaction of (3.4) 
could be tested. Second, (3.2) could be viewed as an 
eigenvector problem and an attempt could be made to find a 
right eigenvector of A, corresponding to the eigenvalue 1, 
which also satisfies (3.4). Normalizing (i.e., dividing the 
eigenvector by the sum of its components) would yield the 
desired result. Extensive literature provides a large 
variety of scluticn methods for both the linear equation 
problem and the eigenvalue/eigenvector problem. For 
example, see Householder [53] and Wilkinson [111]. Because 
of the structure of the matrix A in (3.1) , the 
eigenvalue/eigenvector approach was selected. 



Whether the linear equation problem or the 
eigenvalue/eigenvector problem is chosen, the solution 
techniques fall into three categories: direct, iterative, 
and semi-iterative. An iterative method of solving the 
problem was chosen. The reason for this choice was 
three-fold. First, the iterative methods are, in general, 
easier to pregram. Second, there is not as great a problem 
with -reund-eff as there is with the direct methods. (See 
Dorn and McCracken [29].) Finally, the iterative methods, 
combined with sparse matrix storage techniques, generally 
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require less storage to solve a given problem than do the 
the direct acd semi-iterative methods. Many of these latter 
methods use tow and column operations on the coefficient 
matrix. The result is that many of the elements which were 
originally zero are replaced by nonzero elements and 
therefore require storage. This phenomenon is referred to 
as "fill" in the literature. In addition, most direct and 
semi- iterative methods require storage of the coefficient 
matrix in a form which is less efficient than that discussed 
in secticn 4 cf the preceding chapter. 

On the ether hand, the iterative methods generally 
require longer running times. This disadvantage is somewhat 
diminished by the fact that the best direct methods take 
time to search and rearrange the rows and/or columns of the 
coefficient matrix in an effort to reduce fill and rhe fact 
that space, and not time, is the constraint in many large 
problems . 

For a comprehensive treatment of iterative and 
semi-iterative methods the reader is referred to Varga [106] 
or Young [114]. 



2. MATRIX PEEIIMINARIES 



The nctaticn and some of the elementary 
matrix-theoretical results used in succeeding sections of 
this chapter are discussed in this section. In the first 
subsection much of the notation used in this chapter is 
defined. The second subsection is devoted to permutation, 
irreducible and cyclic matrices and the final subsection, to 
eigenvalues and eigenvectors. 
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2.1. Nota tio n 

In general, upper case letters at the beginning of 
the alphabet will denote matrices, upper case letters at the 
end of the alphabet will denote vectors, and lower case 
letters .will denote scalars and functions. If A is a 
matrix, the element in the i-th row and the j-th column of A 
will be denoted a . Similarly, the i-th component of 

i j 

vector X will be denoted x . The statement "A is an m x n 

i 

matrix” means that A is a rectangular matrix with m rows and 

n cclumns. The statement "A is a matrix of order n” means 
that A is a square matrix with n rows (and columns) . 

Although the interest here lies primarily in the area of 
square matrices having real elements, many of the results 
are also valid for matrices having complex elements, and at 
times rectangular matrices will be dealt with. 

A matrix cr vector is described as real, positive, 
ncnnegative, complex, etc. according to whether its 
elements are real, positive, nonnegative, complex, etc. For 

T * 

any matrix A, A denotes the transpose of A, A denotes the 

complex ccEjugate cf A, and, if A is square and nonsingular, 

-1 

A is the inverse of A. If X is a vector, t a scalar and f 

X * * -1 * -1 

a functicn, then X , X , t , t , f and f are similarly 

defined. Whether a vector is a row vector or a column 
vector will usually be clear from the context. 

The following special notation is used throughout 
this chapter; 

(i) I = the identity matrix (order to be inferred 
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frcm context) . 



(ii) I = the identity matrix of order n. 

n 

(iii) ^ = the matrix of order n all of whose 

n 

elements are zero. 

\ 

(iv) = a matrix (in general, rectangular) all of 
whose elements are zero. 

(V) i = a null vector = a vector all of whose 
components are zero. 

(vi) N = {1,2,,..,n} = the set consisting of the 
n 

first n positive integers. 

Ihe first part of Definition 3.1 follows that cf 
flalmos [ 48 ]. 

het T be any set. A parti tion of T 

is a collection cf nonempty subsets of T, {S } , such that 

i i€J 

each element cf T is in exactly one S . If J = N fcr some 

i n 

positive integer n and T is a set of real numbers, then 

{S } is called an ordered partition if each element of S 
i icj i 

is smaller than each element of S for i = 1,2,... n-1. 

i+1 

If A is an m X n matrix, consideration may be given 
to a partition of A as: 
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A 


... A 


1 1 


12 


It 


A 


A 


... A 


21 


22 


2t 


A 


A 


... A 


31 


32 


3t 


A 


A 


• • • A 


si 


s2 


St- 



where the A are (rectangular) matrices called submatrices 

i j 

cf A. Fcrmally, this situation may be described by letting 



s t 

(S } be an ordered partition cf N and {T } be an 
k k=1 m k k=1 



ordered partition cf N . Then, g € S and r 6 T iff a is 

n k h gr 

one of the elements of the submatrix A . In the cases 

kh 

considered here, A will be a square matrix and the 



partitions {S } and {T } will be identical. As a result, 
k h 

the submatrices on the main diagonal will be square. 



In a similar way partitioning a vector into 
subvectors will be considered. If a matrix A is under 
consideration and has been partitioned as in (3.5), and if a 
ITT T 

vector X =(X,X,..., X)is introduced, then X has n 
12 t 

components ard can be assumed to have been partitioned in 



such a manner that the product A X 

kh h 

is, the product AX is given by; 



is well 



defined. 



That 
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I 











A X 


+ . . . + 


A X n 






11 1 




12 2 




It t 






A X 


+ 


A X 


+ . . . + 


A X 






21 1 




22 2 




2t t 


(3.6) 


AX = 


A X 


+ 


A X 


^ . + 


A X 




31 1 




32 2 




3t t 






A X 




A X 


+ . . . + 


A X 



*- si 1 s2 2 St t-» 



For a row vector Y = (Y , Y , ... , Y ) the product 

12 s 

similarly defined. 



YA is 



2.2. Special Tl£e s of Mat ric es 

Fcr background reading on permutation matrices, the 
reader is referred to Birkhoff and MacLane [12]. For 
background reading on irreducible matrices, see Varga [106] 
or Young [11^]. For background reading on cyclic 
irreducible matrices, see Feller [31] or Gaver and Thompson 
[44]. Although Varga has an excellent discussion of cyclic 
irreducible matrices, the development here is more closely 
related tc the discussion of periodic states of a Markov 
chain as found in these other references. 

Definition 3^2 A perm u tatio n matri x is a matrix 
each of whose elements is either a zero or a one, and each 
row of which, and each column of which, contains exactly one 
nonzero element. 

Several facts concerning permutation matrices are 
now listed without proof. Let M be a permutation matrix. 

(i) M is square and nonsingular. 

- 1 T 

(ii) M = M is a permutation matrix. 

(iii) Premultiplication of a column vector, or 

postmultiplication of a row vector, by M has tae 
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1 



effect of rearranging the components of the 
vector . 

(iv) Premultiplication (postmultiplication) of a 
matrix by M has the effect of rearranging the 
order of its rows (columns) . 

(V) Premultiplication of a square matrix. A, by M 

-1 

and. postmultiplication of the result by H has 

the effect of rearranging the rows and columns of 
A in the same way. If the i-th row and i-th 
column of A correspond to a state (e.g., if A is 
a transition marrix; or if A is a coefficient 
matrix as described in section 4 of the preceding 
chapter) , then the effect is a renumbering of the 
states . 

(vi) The product of M and another permutation 
matrix results in a permutation matrix. 

is ^ permutation matrix of 

order n, the per mu tat ion funct io n associa ted wi th M is a 

function f from N to N such that f (i) = j iff m = 1. 

n n i j 

Seme properties of permutation functions are: 

(i) If f is the permutation function associated 
with H, then f is a well defined, one-to-one, onto 
function . 

(ii) If f is the permutation function associated 

-1 

with M, then f is the permutation function 

-1 

associated with M 

(iii) Properties (iii) , (iv) and (v) of the 
permutation matrix M can be expressed more 
explicitly in terms of the permutation function f 
associated with M. For example, if A is a matrix 
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-1 

of order n, and 3 = MAM , then b = a 

ij f(i)f(j) 



for all i and j in N . 

n 



Cef inition 3_^U A matrix A of order n is ir re ducible 

if n = 1 or if n > 1 and, given any partition {S, T} of N , 

n 

there is an i in S and a j in T such that a #0. 

ij 



iheorera Each of the following two conditions is 

equivalent tc irre ducibilit y of a matrix A of order n: 

(a) There does not exist a permutation matrix M such 

-1 

that MAM has the form 



(3.7) 



MAM 




0 n 



I A 

21 



A 

2 2 -* 



where A and A are square matrices. 

1 1 22 

(b) Either n = 1 or n > 1 and, given any two numbers i 

and j in K , either a ^ 0 ov there exist i , i , ... , i 
n ij 12s 

all in N such that 
n 



a a 
ii i i 
1 1 2 



a # 0 

i j 

s 



Proof of Theorem 3.1 is found in Young [114]. 

In terms of srochastic modelling terminology (see 
Feller [31] cr Gaver and Thompson [44]) part (b) of Theorem 
3.1 says that the transition matrix (and the system as well) 
is irreducible iff given any two states i and j, it is 
possible tc get from i to j in a finite number of 
transitions. 
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If A is a matrix of order n and if for 

any two elements, i and j, of N there exists an integer m 

n 

(possibly dependent upon i and j) such that the element in 

m 

the i-th row and j-th column of A is nonzero, then A is 

irreducible. Furthermore, if A is nonnegative, the reverse 
implication is also true. 

Proof: Both parts are a direct consequence of (b) 

of Theore ui 3.1. 



3^5 An irreducible matrix A of order n 
is said to be cycl ic w ith period k, or k-c yclic , (where k > 



k 

1 is an intecer) if there is a partition, {S } , of N 

i i=1 n 



such that if a * 0, then either 

ij 

(a) i is in S and j is in S ; or 

1 k 

(b) i is in S and j is in S for some m # 1 in 

m m-1 



and if there is no decomposition of U into k' > k subsets 

n 

having rhis same property. 



1^3 If A is a k-cyclic matrix of order n, 
then there is a permutation matrix M such that 
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(3.8) 



MAM 



-1 





ri2> 

n 

1 






• • • ^ 


s ■ 




A 

2 


n 

2 


0 


• mm 0 


0 






A 

3 


0 

n 

3 


• mm 0 


0 




L 




0 


... A 

k 


0 

n 

kJ 



where n + n + ... + 11 = n and all of the nonzero elenants 

1 2 k 

of A are contained in the submatrices A , A . 

1 k 



froof: Let {S } be the partition of N provided 

i i=1 n 

for in Definition 3.5. Let n be the number of elements of 

i 

S for i = 1,2,...,k. Put the n elements of N into a 
i n 

vector V cf length n such that the n elements of 5 are 

1 1 

listed first, then the n elements of S , and so forth. 

2 2 

Define a aatrix M by letting m = 1 if v = j and m =0 

ij i ij 

k 

otherwise. Since {S } is a partition of N , M is a 

i i=1 n 

permutation matrix. Let f be the permutation function 



associated with M. Note that f(i) = j iff m =1 iff v = 

ij i 



j. Thus, f (i) = V for i = 1,2,. ..,n. Letting B = MAf? 

i 



-1 
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I 



i 

i 



I 

I 



i 

I 



« 



and using property (iii) of permutation functions, b = 

i 3 



a . Suppose b o and 1 < i < n . Then, f(i) = v 

f(i)f(j) ij 1 i 

is in S . By (a) cf Definition 3.5, f(j) = v is in S so 
1 j k 

that n n < j < n. That is, b must be in A . The 
k i j 1 

remainder of the theorem follows through similar use of (b) 
of Definiticr 3.5. 



The matrix on the right side of (3.8) illustrates 

what will hereafter be called the first canonical form for a 

k-cyclic matrix. Companion to this is a second canonical 

form which has all of its nonzero elements in the 

sutmatrices immediately above the main diagonal matrices and 

in the submatrix in the lower left hand corner. This second 

canonical form will not be discussed in this thesis. Note 

that these twc forms coincide if k = 2 . Hereafter, the 

statement *’A is a k-cyclic matrix of order n in first 

canonical form” will imply that A is an irreducible matrix 

which satisfies (3.8) with H = I . 

n 



In the case that the period of the irreducible 
matrix A is two, a third canonical form is also of interest. 
The statement "A is a cyclic irreducible matrix of order n 
in third canonical form" will imply that the period o^ A is 
two and there is an integer k (2 < k < n) such that 
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n 

1 


A 

12 


P 


P 


• • • 0 


0 1 




A 

2 1 


P 

n 

2 


A 

23 


P 


... 


P 


(3.9) A = 


0 


A 

32 


P 

n 

3 


A 

34 


• • • 


P 


I 


fi 

. 


P 


P 


P 


... A 

k,k-1 


P 

1. 


where n + n + 
1 2 


• • • 


+ n 

k 


= n . 


An 


example of a 


Markov 



transition matrix exhibiting this form is provided in Feller 

[31], chapter XV, section 2, example (e) where the Ehrenfest 
model of diffusion is discussed. In this example k = n. If 
k = 2, the third canonical form coincides with the first and 
second. 



2.3. Ei ge nval ues and Eigenv ecto rs 

For background reading on the subject of this 
subsection, the reader is referred to Wilkinson [111]. 
Throughout this subsection A is a matrix of order n. 

The complex number t is an 

eigen val ue of A if the matrix (A - tl ) is singular. The 

n 

polynomial f (t) = det (A - tl ) is called the chara cteristic 

n ' 

poly n omi al of A. If t is an eigenvalue of A, then any 
nonnull vector X satisfying the equation: 

(3.10) XA = tX (or equivalently, X (A - tl ) = i) 

n 



160 



is a left eigen vec to r of A corresp onding to t, and any 
nonnull eigenvector Y satisfying: 

(3.11) AY = tY (or equivalently, (A - tl )Y = J) 

n 

is a rig ht eige nve ctor of A co rrespond ing to t. 

The following are properties of eigenvalues and 
eigenvectors. Although in most cases similar statements 
hold for left eigenvectors, the properties and theorems of 
this subsection will be stated in terms of right 
eigenvectors only. 

(i) The eigenvalues of A are the roots of f(t) =0 
where f (t) is the characteristic polynomial of A. 
Hence, there are at most n distinct eigenvalues. 

(ii) Corresponding to each eigenvalue of A there 
is at least one right eigenvector. 

(iii) Any linear combination of right eigenvectors 
of A corresponding to a given eigenvalue is also 
an eigenvector of A corresponding to that 
eigenvalue. In particular, any (nonzero) scalar 
multiple of a right eigenvector is also a right 
eigenvector. 

(iv) If t , t , ..., t are distinct eigenvalues 

1 2 Jc 

of A and, for each i = 1,2,. ..,k, X is a right 

i 

eigenvector cf A corresponding to t , then the 

i 

vectors X , X , X are linearly independent. 

1 2 k 

(V) If t is an eigenvalue of A and X is a rignt 

eigenvector of A corresponding to t, then t is an 

IT T 

eigenvalue of A , X is a left eigenvector of A 

♦ * 

corresponding to t, t is an eigenvalue of A , and 
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♦ # 

X is a right eigenvector of A corresponding to 

* 

t . 

(vi) If A is real/ then t is an eigenvalue of A 
♦ 

iff t is an eigenvalue of A. 

T 

(vii) If A is real and symmetric (i.e., A = A) , 
then all of the eigenvalues of A are real. 



theorem 3_j_4 Let t be an eigenvalue cf A and let X 
be a right eigenvector of A corresponding tc t. Then for 
any nonsingular matrix B of order n, t is an eigenvalue of 

. . . -1 
BAB and BX is a right eigenvector of BAB corresponding 

to t . 



Since X is a right eigenvector of A 
corresponding to t, tx = AX. Hence, tBX = B(tX) = B(AX) = 

-1 

BA <I X) = (BAB ) (EX) . 
n 

Of particular interest in Theorem 3.4 is the case in 
which B is a permutation matrix. In that case (see property 

-1 

(iii) of permutation matrices) , the eigenvectors of BAB 

are identical to those of A except that the order of the 
components are rearranged in a manner which can be specified 
in terms cf the permutation function associated with B. In 
view of property (v) of permutation matrices. Theorem 3-4 
indicates the effect of renumbering the states on the 
eigenvalues and eigenvectors of the transition or rate 
matrix of a Markov chain. 

The following theorem is stated without proof since 
the proof is a simple verification. 
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Iheo retn 3.5 Let A be a k-cyclic matrix of order n 
in first canonical form. Then t is an eigenvalue of A and X 

T 

is a right eigenvector of A corresponding tc t, where X = 

IT I 

(X.X/.../X)/ iff 
1 2 k 



AX if m = 1 

1 k 

(3.12) tX = 

m 

AX ifm=2,3,...,k 

m m- 1 



Returning to equation ( 3 . 2 ), it is seen that the 
eigenvalue/eigenvector problem alluded to in section 1 is 
determination of a right eigenvector of the coefficient 
matrix A corresponding to the eigenvalue 1 . As indicated in 
the next section, Perrcn-Frobenius theory provides 
guarantees that 1 is an eigenvalue and that a right 
eigenvector of A corresponding to the eigenvalue 1 exists 
which also satisfies (3.4), 



3. EIGENSIRDCIURE OF THE CENTRAL SERVER MODELS 



Before considering the solution methods to be discussed 
in the next section, it is helpful to know something about 
the eigenvalues and eigenvectors of the coefficient matrix A 
of eguaticn (3.1). Recall that the objective is to find a 
right eigenvector cf A corresponding to an eigenvalue of one 
and satisfying both (3.3) and (3.4). The first question 
should be, ’'Is one an eigenvalue of A?" Then, "Is there a 
ncnnegative right eigenvector of A corresponding to one?" 

If so, property (iii) of eigenvalues and eigenvectors 
guarantees that it can be normalized (i.e., divided by the 
sum of its components) and, thereby, one can be found which 
satisfies (3.3) as well. Recalling the definition of F as 
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«>rlil|fe 




the steady-state probability distribution for the system, 
the next question should be, "Is this right eigenvector of A 
corresponding to one and satisfying (3,3) and (3.4) unique 
with respect to these properties?" 

Note that these three questions are exactly those which 
are answered by the standard theorems concerning existence 
of an in'variant distribution for an irreducible Markov chain 
with only persistent states. (See Feller [31], Chapter XV, 
sections 7 and 9.) Despite the fact that A is not a 
transition matrix (indeed, A is neither row nor column 
stochastic) , this approach can be taken. Defining an n x n 
matrix B whose i-th main diagonal element is the rate of 
transition from state i (given the system is in state i) for 
i = l,2,...,n and whose nondiagonal elements are all zeros, 
-1 

then C = BAR is column stochastic and (3.1) can be 

rewritten as; 

/ 

(3.13) Q = CQ 

where Q = BF . The Markov chain theory cited above provides 
the existence of a unique probability vector Q satisfying 

(3.13) if the appropriate hypotheses are satisfied. This 
result can then be translated into a result concerning 
(5.1). The reader is referred to Feller [32], Chapter XIV, 
section 7 on applications of Laplace transforms. If the 
limit is taken in Feller's (7.19) as the transformation 
variable approaches zero, the result is the transpose of 

(3.13) above. 

As an alternative to r/iis approach the results of 
Perron-Frcbenius theory will be used here. At the same time 
the eigenstr ucture of A and C will be investigated further. 
Our primary reference for this section is Seneta [99]. Other 
references are Varga [106] and Young [114]. 
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3.1. P€££on-F robenius Th eory 



The following theorem contains many of the important 
results of the Perron- Frobenius theory of finite nonnegative 
matrices. Its proof is found in Seneta [99] from which its 
contents were extracted. 

Ihe ore m 3 . 6 Suppose T is an n x n irreducible 
nonnegative matrix. Then there exists an eigenvalue r of T 
such that: 

(i) r is real and positive; 

(ii) there are strictly positive left and right 
eigenvectors of T corresponding to r; 

(iii) the eigenvectors of T corresponding to r are 
unique up to constant multiples; 

(iv) r is a simple root of the characteristic 
equation of T; 

(V) r > itl if t is any other eigenvalue of T; 

(vi) if I is k-cyclic, then T has exactly k 
eigenvalues t such that lt| = r and these k 

eigenvalues are the roots of the equation 
k k 

t - r = 0 ; 

(vii) if I is not cyclic, then r > 1 1 1 if t # r 
and t is an eigenvalue of T; 

(viii) if s_ is the sum of the elements of the 
i 

i-th row of T and c is the sum of the elements of 

i 

the i-th column of T (for i = 1,2,...,n), then 

(3.14) min s i r ^ max s and min c < r ^ max c 
ii ii ii ii 
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Ihe eigenvalue r referred to in Theorem 3.6 is 
called the Perron- Frobenius eigenvalue of T, and the right 
and left eigenvectors corr esponging to r are called 
Perron-Frcbenius eigenvectors. 

A quick review of the early part of section 4 of the 
preceding chapter reveals that A is a nonnegative matrix. 

As indicated in Appendix C, for many models this matrix is 
irreducible (though perhaps cyclic). Hence, Theorem 3.6 
applies in these cases. If the Perron-Frobenius eigenvalue 
of A were known to have value 1, the three questions posed 
at the begincing of this section would all be answered 
positively by (i) , (ii) and (iii) of Theorem 3.6. 

Consider now the matrix C defined above and 
appearing in (3.13). Since premultiplication of A by H and 

-1 

postmultiplication hy R has the effect of, respectively, 

multiplying each element in the i-th row by the rate of 
transition from state i and dividing each element of the 
i-th column by this same (positive) number, C is also 
irreducible (and cyclic). But C is column stochastic. So, 
part (viii) of Theorem 3.6 indicates that r = 1 is the 

Perron-Frobenius eigenvalue of C. By Theorem 3.4, A = 

-1 

R CR has the same eigenvalues as C. The conclusion is that 
the Perron-Frobenius eigenvalue of A is r = 1. Note further 
that part (v) of Theorem 3.6 provides assurance that the 
Perron-Frobenius eigenvalue is, in some sense, dominant 
among the eigenvalues of A. This fact will become important 
in the discussion of the solution methods in the next 
section. First, however, it is advantageous to examine more 
closely the eigenvalues and eigenvectors of A. 
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3.2. 



ure of a k-cy clic Matrix 



Throughout this subsection, A is a Jc-cyclic matrix 
of order n in first canonical form. Recall that if the 
matrix of interest is not in first canonical form, there is 
a renumbering of the states which will put it into first 
canonical form (Theorem 3.3), and doing so will not affect 
the eigenvalues and will only rearrange the components of 
the eigenvectors (Theorem 3.4). As a matter of notaticnal 
convenience the following definition is provided; 

XiZ ro tat ion fun cti on e is defined 

for any real number s as; 

(3.15) e (s) = exp {-2ni (s/k) } 

where i is the imaginary unit, t /-T 

The function e is called a rotation function because 
mulriplicaticn of any complex number by e(s) induces a 
rotation of that complex number through an angle of size 
2ns/k radians about the origin in the complex plane. Note 
that for any real numbers s and t and any integer m; 

t 

(i) [€(£)] = « (St) 

(ii) e (s) e (t) = e (s + t) 

(iii) e (nik) = e (0) = 1 

Turn now to the eigenstructure of A when A is in 

first canonical form. From (3.3), with M = I , note that 

n 

since each diagonal submatrix of A is square, the product 

A A is compatible as is the product A A for i = 2, 3, 

Ik i i-1 
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k. For i = 1, 2, .../ k, define a matrix B = 

i 

AA ...AAA ...A . Although the following three 

i i-1 1 k k-1 i + 1 

theorems are stated and proved for B = B # similar results 

k 

hold for each E . 

i 



lil If f is an eigenvalue of A and X is a 

T 

right eigenvector cf A corresponding to t where X = 



ITT k 

(X ,X ,...,X ), then w = t is an eigenvalue of 3 and X is 
12k k 



a right eigenvector of B corresponding to w. 



£roof: Using Theorem 3.5 and, in particular, 

equation (3. 12) ; 

k 

BX = A A ...A X = tA A ...A X = ... = t X = wX . 
k kk-1 Ik kk-1 21 k k 



Th eore m 3 . 8 If w # 0 is an eigenvalue of 3 and X^ 
is any right eigenvector of B corresponding to w, and if t 



is any complex k-th root of w (i.e., any solution of t = 



w) , then t is an eigenvalue cf A and X is a right 

T T T T 

eigenvector cf A corresponding to t for X = (X ,X ,...,X ) 

12 k 

where X = (1/t)A X and, for m = 2,3,...,k-1, X = 
11k m 

(1/t) A X 

m m-1 



il25f- By Theorem 3.5, all that remains is to show 
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that tX = A X . To that end use, respectively, the 
k k k-1 ^ 

del in iticns of X ,X /^aA/X and 3 and then the fact that 

k- 1 k-2 1 

X is a right eigenvector of B corresponding to w and the 
k 

k 

fact that t = u: 



AX = ' (1/t) A A X = (1/z) A A A X = . . . 

k k-1 k k-1 k-2 k k- 1 k-2 k-3 

k* 1 k-“ 1 1 

= (1/t) A A ...A X = (1/t) BX = (1/t) wX = tX 

k k-1 1 k k k k 



if f is any eigenvalue of A and X is 

I 

any right eigenvector of A corresponding to t, where X = 



IT I 

(X ,X ,...,X ), then for each s = 1,2,...,k-1, t = te (s) is 
12k s 

also an eigenvalue of A and Y is a right eigenvector cf A 

s 

corresponding to t where 

s 

ITT T 

Y = (e(-s)X ,e(-2s)X ,...,e(-ks)X ). 

s 1 2 k 



P roo f ; First consider the case in which t = 0. Let 

s te any element of N . Then t = te(s) = 0 is an 

k-1 s 

eigenvalue cf A. Furthermore, since X is an eigenvector and 

€ (z) #0 for all real z, Y is nonnull. A [e(-ks)X ] = A X 

s 1 k 1 k 

= i = t [e(-s)X ]. Similarly, for m = 1,2,...,k-1, 
s 1 

A [e(-ms)X ] = e(-ms)A X = i = t [e{-(m+1)s)X ]. By 

m+1 m m+lm s m+1 

Theorem 3.5, Y is a right eigenvector of A corresponding 
s 

to t =0. 
s 

k 

Now suppose t # 0 and let w = t . Again, 
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let s be an artitcary element of N . Note that 

k- 1 

k k k 

t = [te(s) 3 = t e (sk) = w. By Theorem 3.7, w is an 

s 



eigenvalue of B and X is a right eigenvector of B 



corresponding to w. Since t is one of the k-th roots of w, 

s 

t is an eigenvalue of A by Theorem 3.8. Using Theorem 3.8, 
s 

the fact that Y is a right eigenvector of A corresponding 



s 

to t is verified as follows. First, (Y ) = (1/t )A X = 

s s 1 s 1 k 

e(-s) (1/t)A X = e (-S) X . Proceding by induction on m (from 
1 k 1 

m = 2 to m = k - 1) , (Y ) = (1/t )A X 

s m s m m- 1 

(1/t) A [e(-(m-1)s)X 3 = e (-ms) X . The conclusion 

m m- 1 m 

follows since e(-ks) = 1. 



4. DETEESINATION OF DOMINANT EIGENV ALUE/EIGENVECTOR 

As noted at the end of subsection 2.3, the problem under 
consideration is the determination of a right eigenvector of 
the coefficient matrix A corresponding to the eigenvalue 1. 
The Perrcn-Frobenius theory discussed in that subsection not 
only led to a guarantee that one is an eigenvalue of A and 
that a ncnnegative right eigenvector of A corresponding to 1 
exists; it also guaranteed that one is a dominant eigenvalue 
(i.e., if t is an eigenvalue of A, then |t| < 1) and that 
the right eigenvector mentioned is positive and unique up to 
scalar multiplication. In addition, the exact form of all 
other eigenvalues of A having modulus one and of the right 
eigenvectors corresponding to these other dominant 
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eigenvalues were determined (see Theorem 3.9). 



This section discusses some simple iterative methods of 
determining the sought-after right eigenvector. Since many 
of the central-server models of interest are cyclic, the 
effect of applying these methods to a k-cyclic matrix are 
considered. For a more extensive account of the acyclic 
case, the reader is referred to Householder [53]. The 
methods discussed here closely parallel the stationary 
iterative methods for solution of a system of 
(nonhomogeneous) linear equation discussed by Young [114] in 
his third chapter. 

Once again, even though the discussion is confined to 
right eigenvectors, a parallel development is possible for 
left eigenvectors. 



4.1. The Power Meth od 



Ike well-kncwn power method , is a favorite tool for 
determining the steady-stare probability for finite ergodic 
Markov chains in beginning stochastic modelling courses. It 
will presently be shown that this method runs into 
difficulty whenever the matrix under consideration is 
cyclic. Nonetheless, it is an important method since each 
of the ether methods to be discussed is equivalent to the 
power method applied to a matrix which is derived from the 
matrix of interest. 

Let A be a matrix of order n. For the present, 
assume nc further knowledge concerning A or its 
eigenstr ucture. The power method is an iterative technique 

( 0 ) 

which generates, for some starting vector X , a sequence 
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1 






4 

4 



( 1 ) ( 2 ) 

of vectors X , X 




where for each i > 1 , 



(3.16) 



(i) 



-1 _(i-1) 



= b AX 



where b is a normalizing factor. Under appropriate 
i 

conditions, the normalizing factors b will converge to a 

i 

(i) 

dominant eigenvalue of A and the vectors X will converge 



to a right eigenvector of A corresponding to this dominant 

eigenvalue. Note that (3.16) can be written as a sequence 
of n equations as follows; For j = 1,2,...,n, 



(3.17) 



(i) -1 ^ (i-1) 

X = b > a X 
j i jk k 

k=1 



Keeping this formulation in mind, computational 
implementation of the power merhcd follows: 

The Power Method 

(X) Choose a starting vector X , a normalizing 

(row) vector Z, and a convergence criterion. Set 
i = 1. 

(i) (i-1) 

(ii) Calculate Y = AX . Computationally, 

this is accomplished in the following manner: For 

3 ~ 



(3. 18) 



(i) 



y . 

3 



n 

= Z a 
k=1 



X 

jk k 



(i-1) 
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1 



i 



1) (1) -1 (i) 

( 111 ) Let b = ZY and X = b Y . 
i i 

(iv) If i = 1 , or if i > 1 and the convergence 

criterion is not satisfied, increase i by 1 and go 

to (ii) . Otherwise, the method has converged; b 

i 

j . (i) . 

IS a dcminam: eigenvalue and X is a right 

eigenvector corresponding to b, . STOP. 

Typical normalizations used set the sum of the 

(i) 

components of X egual to one (all components of Z are 

(i) 

ones) or set the first component of X egual to one (the 

first component of Z is one, all others are zero; b = 

i 

(i) . , ... 

y^ ) . Typical convergence criteria include: the absolute 



difference between b and b is small, the percentage 

i i-1 

difference between b and b is small, the largest 

i i- 1 

absolute difference between corresponding components of X 



(i) 



(i- 1) 

and X is small, and the percentage change in the sum of 

the absolute differences between corresponding components of 

(i) (i-1) 

X and X is small. ("Small” means "less than e" for 

seme e > 0.) 



Turn now to the convergence properties of the power 

method. Suppose that there are m distinct eigenvalues of A, 

namely t , t , ..., t , numbered such that 
1 2 m 
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(3.19) 



It I > It I > > It I 

12 m 



Note that all cf tte right eigenvectors of A and all of the 
vectors generated in the power method are elements of the 

space of n-dimensional complex vectors, hereafter called C^. 



It can be shown that there is a basis of C , B = {H , H , 



. H }, such that each H is associated with exactly one 
n i 



eigenvalue cf A, and if B is the subset of B associated 

j 0 



with eigenvalue t for some j = 1,2,...,m, then each right 



eigenvector cf A corresponding to t is a linear combination 

j 

of the elements of B . (For those familiar with this 

j 

subject area, the elements of B are the columns of a matrix 

0 

-1 

H such that K AH is in Jordan canonical form. Others are 



referred to Househclder [53].) Consider a special case in 



which each element of B is an eigenvector of A. More 

0 



general situations exist. However, their consideration 



leads to the same conclusions, complicates the analysis, and 
adds very little in the way of insight into the subject 
matter of this thesis. 



n 

Since 3 is a basis for C and any starting vector 
0 

(0) n (0) 

X is in C , X can be uniquely expressed as a linear 



combination of the elements of B . Since each element of 3 

0 0 
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is an eigenvector of A, property (iii) of eigenvectors 
implies: 



(3.20) 




m 



7 
3 = 1 



c V 
3 j 



where V ' is a righr eigenvector cf A corresponding to t . 

3 3 

Note that V is a linear combination of the elements of B 
3 3 

for each j = Suppose now that a normalizing 

vector Z has teen chosen. For each s = 1,2,3,..., 



(3.21) 




s 



(TT b.) 

1 

i=l 



m 

-1 s 

■5 C t V 

3'=, ^ ^ ^ 



where b is as in step (iii) of the power method, 
i 



goal in 



Consider the case in which |t | > |t |. Then, the 

1 2 

applying the power method is to find the value of t 



and a right eigenvalue cf A corresponding to t^. If X is 

not orthogonal to all of the vectors in B , then c 9^ 0 and 

1 1 

the power method is guaranteed to converge to t^ and seme 
multiple cf the right eigenvector V^. To see how this 
happens in the case under consideration, rewrite (3.21) as 



175 



s m 

(s) s 

(3.22) X = [TT (t /b.)] Z (t /t ) c V 

"I 1 1 j j 

i=1 j=1 



= £ty(TT b.)]{c V + (t yt )%y 
1 1 11 2 12 2 
i=1 



+ ... + (t /t ) c V } 
ml mm 



Since It I > |t I for j = 2,3,...,m, (t /t ) goes to zero 
1 3 j 1 

as s increases. Hence, the sum approaches c V . To see 

1 1 

what happens to the factor preceding the sum on the right 
hand side cf (3.22), consider a value of s so large that the 
sum can he replaced by Then, replacing the factor 

(s) 

preceding the sum by w , (3.22) shews that X is 

s 

approximately egual to w c V . Followino the steps of the 

s 1 1 

(s+1) 

power metfcod, Y is approximately egual to w c AV = 

s 1 1 



w c t V and b is aoproximat ely egual to w c t ZV = t 
sill 3+1 S 1 1 1 1 



(£) 

since X had already been normalized. Thus, w 



s+1 



w (t /b ) = w approximately. So, b converges to t and 

s 1 s+1 s si 

(S) 

X converges to a multiple of V^. Also note from (3.19) 



and (3.22) that J t /t 1 is a measure of the rate at which 

2 1 
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this convergence takes place. That is, the smaller this 
quantity is, the mere rapidly convergence takes place. 

( 0 ) 

If X IS orthogonal to all of the vectors in 3 

1 

(i.e., c^ = 0), then it is seen from (3.21) that the power 

method cannet converge to t^ and a corresponding right 

eigenvector, though it might converge to one of the other 
eigenvalue/eigenvector pairs. 

New consider the case in which It 1 = It 1 > It | 

1 k k + 1 

for some 1 < k < m (where t = 0 if k = m) . This is the 

m + 1 

case if k is k-cyclic. Then, for large s, terms involving 

t , t , ..., t in (3.22) can be ignored. The 

k+1 k+2 m 

(approximate) result is (3.22) with the upper limit on the 

(s) 

sum changed from m to k. So, for each s, X xs a linear 

combination cf the right eigenvectors V , V , ..., V where 

12 k 

the coefficients are dependent upon s. In the case that A 
is k-cyclic. Theorem 3.9 gives some insight into the nature 
of this dependence. In particular, t /t = e (j) for j = 

j 1 

1,2,. ..,k. Sc, we get (approximately) : 
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(s) 

(3.23) X = w ? e(sj) c V 



(s) 

From this it is easy to see that (for s large) X is 

(s + k) 

approximately the same as X but probably different from 

(s+1) (s + 2) (s + k-1) 

X /X , , X . Even w does not converge, 

s 

(s) 

though |w I does. Note that even though the sequence X , 

C 

(S+k) (s + 2k) 






.. converges, rhe limit is not an 



eigenvector cf A, in general. 



Example 2 . 1 

Consider applying the power method to the following 
ncnnegative, 2-cyclic, irreducible matrix A: 



A 



*-2 



1/2 

0 



1 



j 



The following table lists the first four b *s and X s. 

i 

The normalizing vector is Z = (1,1); the starting vector is 
given in the table. 



i 

b 



(i) 

X 



0 



r 2-, 

L-1J 



1 2 

7/2 2/7 

r~ 1/7 t r 
L 8/7-» *--iJ 



3 4 

7/2 2/7 

r~1/^T r 2 t 
L 8/7-1 «— 1J 



178 



By Theorems 3.6 and 3.9, A has two distinct eigenvalues; 
both are real and one is the negative of the other. The 

positive eigenvalue is between 1/2 and 2. Noticing that 

— 1 |-0 It pi Ot 

RAE - j j where R = j | , it can be concluded froir 

il 0-» 1-0 2-» 

Theorem 3.4, part (viii) of Theorem 3.6 and Theorem 3.9 that 

the eigenvalues of A are 1 and -1. It is important to 
notice that the reason the power method does not converge is 
that A is cyclic. 

Another problem with the power method involves the 
choice cf Z. For example, if the sum of the components of 

is zero, then Z = (1,1,..., 1) will lead to difficulties. 

Since is unknown prior to application of the power 

method, there is often no assurance that any particular 

choice of Z will avoid this complication. However, if A is 
ncnnegative and irreducible. Theorem 3.6 parts (ii) and 

(iii) guarantee that V has no zero components and that all 

1 

components of V have the same sign. Thus, both normalizing 

1 

procedures mentioned earlier in this subsection are 
guaranteed tc avoid such difficulties. 

These same parts of Theorem 3.6 can also be used to 

help avoid the problem of choosing a starting vector which 

is orthogcnal to V . Since all of the components of V have 

1 1 

the same sign, any vector orthogonal to must have seme 
components with positive real part and some with negative 

(0) 

real part. So if X has all nonnegative (or all 



I 
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nonpositive) components, X cannot be orthogonal to V . 

1 

As a final comment concerning the power method note 
that the purpose of the normalization procedure is tc insure 
that the method converges to a nontrivial value. Without 
the normalization procedure (3,22) becomes; 



(3.24) 



m 

(£) S S 

X = t 5 (t yt ) c .V. 

^ . 3 1 3 3 



The sum on the right hand side of (3.24) is the same as the 

sum on the right hand side of (3.22) and approaches the same 

. . s 

limit as s becomes large. However, t approaches zero if 



'S' 



< 1 or infinity if |t | > 1. If |t | = 1 but t 1 



1 



1 



then t^ = exp{yi} for some real number y which is net an 

(s + 1) 

integer multiple of 2n. In this case for large s, X 
(£) 

approximately. That is, the magnitude of the 



vectors generated by the power method will converge, but 

there will be an angular displacement of y radians between 
successive vectors for large s. As indicated at the end of 

subsection 3.1, t^ = 1 for the cases of interest in this 

thesis. In such cases normalization is not required fer 

convergence cf the power method. On the other hand, if the 
convergence criterion used in step (iv) of the power method 
is based upon an absolute difference rather than a percent 
difference, the normalization procedure (or lack thereof) 
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will affect the number of iterations necessary to attain 
convergence and the degree to which the convergence 
criterion is satisfied by the final answer. 



For example, suppose that the convergence criterion 

(£) (S-1) -4 

is; max |x -x | < 10 . Suppose that after k steps 

i i i 

(k) (k-1) -4 

it is found that irax lx -x | = 0.8 x 10 . Then, the 

i i i 

(k) (k) 

method has converged and P = X /(ZX ) is the desired 



solution to the system (3.1), (3.3), (3.4), where Z = 

(k-1) (k-1) 

(1 , 1, . . . , 1) . Eut, xf Q = X / (ZX ) , then P = AC 

-4 (k) 

(approximately) and max Ip -g I = (0.8x10 ) / (ZX ). Mote 

i i i 

(k) -4 

that if ZX < 0.8, max |p -g 1 > 10 ; i.e., more 

i i i 

iterations should have been performed. By way of contrast, 
(k) -5 

if ZX > 8.0, max I p -g I < 10 and fewer iterations were 

i i i 

probably reguired. Now if c is chosen so that ZV = 1, 

j 3 

(k) T (0) T 

then ZX =c=7X /(VV) which is highly dependent 

11 11 

upon the relationship between the starting vector and the 



desired scluticn vector P = (see eguations (3.24) and 

(3.20)). Since V is unknown, this relationship is also 

1 



unknown. 
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4.2. Th€ Relaxed Power M eth od 



The power method is homologous to the Jacobi method 
for iterative solution of a system of linear equations. The 
method discussed in this subsection is homologous to the 
relaxed Jacobi method. Young [114], Chapter 3, is suggested 
as backgrcunc reading for the subject of this and the 
following twc subsections. 

The relaxed power method yields at each iteration a 
weighted sum, or average, of the vector calculated in the 
power method and the preceding vector from which it was 
calculated. To specify this method replace step (ii) cf the 
power method by: 

(i) (i-1) (i-1) 

(ii) Calculate Y = wAX + (1-w) X 

Computationally, this is accomplished in the 
following manner: For j = 1,2,...,n, 



(3.25) 



( 1 ) 

y . 

3 



= w ? 
k=1 



(i-1) (i-1) 

ax + (1-w)x 

jk k j 



Usually the real number w is chosen at the start (i.e., step 
(i) ) , though it could conceivably be changed at each 
iteration . 

The conceptual difference between the power method 
and the relaxed power method is best illustrated in terms of 
an analogous situation. Consider attempting to determine a 
solution for an equation f (x) = 0. Using some iterative 
method (e.g., Newton's method), a sequence of numbers 

{x ,x ,x ,...} is generated which converges to a number x 
0 1 2 
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I 

K- 

I 



i, 

r. 



I 







such that f(x) =0. This is done by choosing a starting 

value X and, for i = 1,2,3,.,., letting x = g (x ) where 
0 i i- 1 

g is a function somehow related to f. Now consider an 

attempt to accelerate the convergence by developing a 

sequence {y ,y ,y ,...} where y = x and, for 1 > 1, y = 

• 0 1 2 0 0 j 

wg (y ) + (1-w)y • How could w be chosen so that the 

j-1 j-1 

y 's converge to x more rapidly than the x * s? If the 
i i 

function g is known xo consistently overshoot the mark 

(i.e., if X is known to always lie between g (y) and y) , then 
choosing w between zero and one will always put y between 

j 

g(y ) and y and (hopefully) therefore closer to x. On 

j-1 j-1 

the other hand, if g is known to consistently undershoot rhe 
mark, then a choice of w > 1 will push y beyond g (y ) and 

j j- 1 

(again, hopefully) closer to x. 

Similarly, the relaxed Jacobi and relaxed power 

(i) 

methods seek to choose w so that the resulting X at each 

iteration is closer to the vector sought than it would be if 
w = 1 (i.e., no relaxation) . The method is said to be an 

underrelaxed method if 0 < w < 1 and an overrelaxed method 
if w > 1, The question of whether to underrelax or 
cverrelax in a particular situation seems to be related to 
the values of the eigenvalues of the matrix in question. 
Since these are generally not known, the question of what 
value to give w is usually a difficult one. Wallace and 
Rosenberg [108] use a relaxed power method in their 
recursive queue analyser program. They suggest w = 1 . 3 as a 
gocd choice in many of the problems they have solved. The 
choice or w for k-cyclic matrices is investigated in the 
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remainder of this subsection. 



Notice frcm the revised step (ii) above that the 
relaxed power method is the power method applied to the 

matrix B = wA + (1-w) I. If A is a nonnegative, k-cyclic 

w 

(irreducible) matrix, then B is irreducible (all nonzero 

w 

positions in A are still nonzero in B ) , is acyclic if w * 

w 

1, and is ncnnegative if 0 < w < 1 . So B can be viewed in 

w 

terms of Earrcn-Frobenius theory if 0 < w < 1. However, 

mere information can be obtained by directly examining the 

relationship between the eigenstructures of A and B . 

w 

Let t be any eigenvalue of A and let V be any 
i 3 

right eigenvector of A corresponding to t . Then, B V = 

3 w j 

[wA + (1-w)I]V = WAV + (l-w)IV = wt V + (1-w)V = 

j j j j j j 

[wt + (1-w) jV . That is, V is a right eigenvector of B 

j j j » 

corresponding to eigenvalue [wt + (1-w) ]. In essence, the 

3 

relaxaticE leaves the eigenvectors untouched and replaces 

each eigenvalue t by a weighted sum of t and one, g (t) = 
wt + (1-w). Note that t = 1 is the only fixed point for the 
transformation g. If 0 < w < 1, g (t) is closer to one than 
t is. If w > 1, g (t) is further away. (If w < 0, g (t) is 
on the other side of one.) This transformation of t by g is 
performed along a straight, line in the complex plane passing 
through t and one. 

New consider the case in which the coefficient 
matrix A is k-cyclic. Let t be any eigenvalue of A such 
that |t| = 1 but t # 1. (Note that Theorem 3.6 guarantees 
the existence of k - 1 such eigenvalues.) Since the 
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straight line in the complex plane passing through t and one 
intersects the unit circle only at t and 1 , g (t) will te 
outside the unit circle if w > 1 (or w < 0) . That iS/ if w 
> 1, one is no longer a dominant eigenvalue and the power 
method will net converge to the desired eigenvector. Cn the 
other hand, if 0 < w < 1, all of the eigenvalues on the unit 
circle (except one) are drawn into the unit circle as they 
are drawn closer to one, and one is the unique dominant 

eigenvector of B . 

w 

Now consider more specifically the effects of this 

transf or maticn on the eigenvalues of A. Suppose w is any 

real munter between zero and one (noninclusi ve) and let t = 

(x,y) be any eigenvalue of A other than (1,0). (The ordered 

pair notation for complex numbers is used here.) The 

eigenvalue cf B corresponding to t is g (t) = wt + (1-w) = 

w 

(wx+1-w,wy). The question is whether g(t) is closer to the 

origin or further from the origin than t. Sc, note that: 

ig (t) I < iti 

2 2 2 2 2 
iff (WX+1-W) + w y < X + y 

2 2 2 2 2 
iff (1-w )y + (1-w )x - 2w(1-w)x - (1-w) > 0 

2 2 

iff y + X - 2[w/(l + w)]x > (1-w)/(1+w) 

iff y^ + [X - y/(1 + w) ] > 1/(1 + w) 

From this it can be seen that the transformation will draw 
closer to zero only those eigenvalues outside the circle in 
the complex plane with center at (w/(1+w),0) and radius 
1/(l+w). Those eigenvalues inside this circle will be drawn 
further from the origin. As depicted in Figure 3.1, this 
circle is inscribed within the unit circle, touching it only 
at (1,0). Since 0 < w < 1, 1/2 < l/(1 + w) < 1. Hence, the 

origin is always within the circle and the circle contains 
at least cne-feurth the area of the unit circle. This type 
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Figure 3.1- -Effects of the Relaxed Power 
Method on the Eigenvalues of 
Magnitude Less Than One 
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I 



! 



of informaticn is sufficient to determine the best the case, 
value for w (in terms of the convergence properties of the 

power method on B ) if all of the eigenvalues of A are 

w 

kncwn. Dnf ortunat ely , this is seldom Precise theorems on 

the convergence properties of the relaxed power method are 
very difficult to determine. 

One exception is the case in which A is a k-cyclic 
matrix and one is a dominant eigenvalue of A. Since k of 
the eigenvalues of A are known (they are the k-th roots of 

unity) , the eigenvalues of B corresponding to these can be 

w 

used to provide a lower bound for the measure of convergence 

of the power method, |t^/t^|. In Figure 3.2, point d is at 

the eigenvalue of A on the unit circle closest to point a, c 

is at the eigenvalue of S corresponding to the eigenvalue 

w 

of A at d, and b is at the midpoint of the line segment ad. 

In radians the angle aod is 2n/k, angles aob and bod are 

each half of this or n/k, angle cod is w(2a/k), and angle 

aoc is ( 1-w) (2n/k) . Line segments ao and od each have 

length one, and line segments ad and ob are perpendicular. 

From this it can be concluded that line segment ob has 

length cos fn/k) and line segment oc has length: 

h = [cos (n/k) ]sec[n/k - (2n/k) min (w, 1-w) ] 

Since point c is at an eigenvalue of B , 1 t /t | = i t | > 

w 2 1 2 

h. Choosing w to m-inimize h may relax this lower bound 

without improving the convergence properties. Note that 
this bound is useless. if k = 2. 
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Figure 3.2--E£fects o£ the Relaxed Power Method 
on an Eigenvalue o£ Magnitude One 



188 



4.3 



• The Gauss-Seidel Me tho d 



Returning to the steps of the power method as set 
forth in subsection 4.1, note that at each iteration all of 
the components of the old vector are used to determine each 
component of the new vector. Computationally this means 
that the old vector must be retained in storage at least 
until the new vector has been completely determined. 

However, note that at each iteration, step (ii) is 
accomplished by, in sequence, applying (3.18) with j = 1, 
then with j = 2, then with j = 3, and so forth. In other 



words, fcr j > 1, 




is known for k 



2 , 



j. - 1 and 



could be used in the place 



of 



(i-1) 

X 

k 



in calculating 



( 1 ) 

y . • 



This is, in effect, done if (3.18) is replaced by 



j- 1 n . 

(3.26) y = Z^y +^ax 

j jk k jk k 

k=1 k=j 



This observation is the basis of the Gauss-Seidel method. 

Consider the matrix manipulations necessary to 
replace (3.18) by (3.26). The matrix A can be uniquely 
written as A = 0 + L where U has all zeros below the main 
diagonal and L has all zeros on and above the main diagonal. 
The collection of equations specified by (3.26) for j = 1/ 

2, ..., n is represented in matrix notation as 

. (i) (i) (i-"') 

(3.27) y = LI + UX 
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since L has all zeros on and above the niain diagonal, I - L 

is nonsingular. Collecting the terms involving ^ and 

-1 

multiplying through by (I - L) (3.27) becomes 



(i) -1 (i-1) 

(3.28) 1 = (I-L) UX 

Cef i ni tion 3. 8 Let A be a square matrix. Then the 
-1 

matrix G =- (I-L) U is the G au s s-Seide l iter ative m^rix of 

A where L and U are the unique matrices such that L has all 
zeros on and above the main diagonal, 0 has all zeros below 
the main diagonal, and A = L + U. 

The Gauss-Seidel method is the power method applied 
to the Gauss-Seidel iterarive matrix of A. To specify this 
method replace step (ii) of the power merhod by: 

(i) (i-1) 

( 11 ) Calculate Y = GX where G is the 

Gauss-Seidel iterative matrix of A. 

Computationally, this is accomplished by 
successively applying (3.18) with j = 1 (3.26) 

with j = 2, 3, ..., n. 

Very little is known about the relationship between 
the eigenstr uctures of A and its Gauss-Seidel iterative 
matrix, G. It can be shown that G is nonnegative if A is 
ncnnegative. Seneta [99] shows that if A is nonnegative, 

the Perr cn-f robeni us eigenvalues r (of A) and r (of G) 

A G 

have one of the following relationships: 

fi) 0 < r < r < 1 ; or 
G A 

(ii) r = 1 = r ; or 
A G 
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(iii) 1 < r < r . 

A G 

In the cases cf interest in this thesis, of course, (ii) 
applies. In fact, there is some hope that the Gauss-Seidel 
method will converge to the desired eigenvector, as can be 
seen in the following theorem. 

Let A = U + L be any square matrix and 

- 1 

let G = (I-i) U be the Gauss-Seidel iterative matrix cf A. 

Then one is an eigenvalue of A and X is a right eigenvector 
cf A corresponding to one iff one is an eigenvalue of G and 
X is a right eigenvector of G corresponding to one. 

• X = AX = (U + L) X 
iff (I-L) X = UX 

-1 

iff X = (I-L) UX = GX. 

Add to theorem 3.10 the result, extracted from 
Seneta, that one is the Perr on-Frobenius eigenvalue of G in 
the cases of interest in this thesis, and indeed there is 
some hope that the Gauss-Seidel method will converge to the 
desired eigenvector of A. One primary problem still 
remains: G may be cyclic. That is, the Gauss-Seidel method 

may not converge. The following theorem, which is perhaps 
the most striking result known about the relationship 
between the eigenstructures of A and G, considers this 
problem in the case that A is k-cyclic. 

Th eo rem Let A be a k-cyclic irreducible matrix 

of order n in first canonical form (as specified in (3.3) 

with il = I) . Let G be the Gauss-Seidel iterative matrix of 

A. Then, if t , t ,..., t are the distinct eigenvalues of 
12 m 

A numbered sc that (3.19) is satisfied, and if 
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k 

(i) Except as specified in (ii) , t is -aji 



eigenvalue cf G iff t is an eigenvalue of A. 

(ii) Zero is an eigenvalue of G (even if it is 
not an eigenvalue of A) , and every nonnull vector 
I IT T 

X, such that X = (X ,X ) where X = 0 , is 

12 k k ■ 



an eigenvector of G corresponding to zero. 



(iii) The Gauss-Seidel method will converge 
(barring an unfortunate choice of starting vector) 

k 

to an eigenvalue w = t and an eigenvector X, such 



I T T T 

that X = (X ,X ,...,X ), where X is an 
12k k 



eigenvector of B = A A ...A corresponding to w 

k k-1 1 



(see Theorem 3.7) and for i = 1,2,...,k-1, X = 

i 



(1/w) A A ...AX. 
i i-1 Ik 



First determine the form of G in terms of 
the submatrices of A. 
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I 



u = 



I - L = 



r0 

n 

1 




• • • 


iZJ 


*, • 


0 


0 

n 

2 


• • • 


0 


0 


0 


0 


• • • 


n 

k-1 


0 


0 

L 


{2f 


• • • 


0 


0 

n 

k-* 


T 

n 

1 


0 


0 


« • • 


0 


-A 

2 


I 

a 

2 


0 


• • • 


0 




-A 

3 


I 

n 

3 


• • • 


0 








• • • 


-A 

k 



kJ 



(I - 



-1 



L) 




I 

n 

1 

A 

2 



A A 
3 2 







A 

3 



^ ... ji 0 - 

0 •.. 0 0 

X m m m ^ 0 

n 

3 



AA •••A AA aavA AA aa^A aaaA X 

k k-1 2 k k-1 3 k k-1 4 k n 

k-* 
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I. 



-1 

G = (I-I) U = 



r0 


0 


• • • ^ 


A 1 


n 






1 


1 






0 


0 


• • • 0 


A A 




n 




2 1 




2 




0 


0 


• • • 0 


AAA 






3 2 1 


0 


0 


. • • 0 


A A ... A 


L 






k k-1 1J 



IT IT T 

Let X - X = (X ,X ,...,X ) be any vector of length 

12 }c 



n. Then, 



(3.29) 



GX = 



A X 
1 k 

A A X 
2 1k 

A A A X 
3 2 1 k 



A A ...AX 
L k k-1 1 kJ 



which is independent of X , X , ..., X . Note from 

12 k-1 

(3.29) that w is an eigenvalue of G and X # C is a right 



eigenvector cf G ccrrasponding to w iff 

wX = A A ... A X 
i i i-1 Ik 

for i = 1,2,...k. In parricular, wX = A A ...A X = BX . 

k k k-1 1 k k 

Part (i) now fellows from Theorems 3.7 and 3.3. Part (ii) 

follows immediately since X =0 implies GX = 0 = OX. By 

k 

ic }c ic )c 

Theorem 3.8, t = t = ... = t . Since |t j > |r I, t 

12 k k k+1 1 



is the unique dominant eigenvalue of B. This is sufficient 
to prove (iii) . 
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The proof of Theorem 3.11 relies heavily upon the 
hypothesis that A is in first canonical form. It is natural 
to ask if the results (particularly part (i) ) are 
independent of this assumption. For k > 2 the answer is 
negative. In fact^ a proof similar to that of Theorem 3.11 
reveals that G is cyclic of order k-1 (though not 
irreducible) if A is in second canonical form. The 
following example illustrates that point. 

Example 3.2 

rO 1 0-, 

Let A = jO 0 1). Then, A is 3'-cyclic and irreducible. 
1-1 0 QJ 

fO 1 0-, r 1 0 0-, -1 r1 0 C-, 

Also U = 10 C 11,, I - L = 1 0 1 0| , (I - L) = 10 1 0|, 

•■0 0 OJ «--i 0 1J j-1 0 1J 

f C 1 0-j 

and so G = 1 0 0 1 1 . G is seen to have period 2. 

•-C 1 OJ 

In the case that k = 2, the first and second 
canonical forms coincide. Hence, an example similar to 
Example 3.2 will net yield a negative result. For this 
case: 



££S 3 i 2 t^e: If A is a 2-cyclic, irreducible matrix, 

and if G is the Gauss-Seidel iterative matrix of A, then t 

2 

is an eigenvalue of A iff t is an eigenvalue of G, except 

that zerc is an eigenvalue of G even if it is not an 
eigenvalue cf A. 

If A is in first canonical form, this result is true 
by Theorem 3.11. The following theorem lends plausibility 
to the conjecture by showing that it is true if A is in 
third canonical form. 
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Theorem 3^22 Let A be a 2-cyclic matrix in third 
canonical form as specified by (3.9), and let G be the 
Gauss-Seidel iterative matrix of A. Then 

(i) zerc is an eigenvalue of G and any nonrrivial 

T T 

vector T such that Y = (Y^, C, 0, 0) is an 



eigenvector cf G corresponding to zero; and 



(ii) t # 0 is an eigenvalue of A and X is an 



eigenvector of A corresponding to t, where X = 



IT T 2 

(X , X , X ), iff t 0 is an eigenvalue of 

12k 



G and Y is an eigenvector of G corresponding to 



2 T IT2T k-lT 

t , where Y = (X^,tX^,t X^,...,t X^) . 



Proof ; First determine G in terms of the 
sutmatrices cf A. 



r 0 A 0 ...0 0 1 



n 

1 


12 








ii 


n 
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A • • • 

23 


0 


0 


0 


0 


0 • • • 

n 

3 


0 


0 


0 


0 


0 9mm 


0 

n 

k-1 


A 

k-1 ,k 


0 

. 


0 


0 9 9 9 


0 


0 

n 

k J 
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I 







/ 



I - L 



-1 

(I-L) 



r I 

n 

1 


0 


^ • • • 


0 


0 


0 1 


-A 

21 


I 

n 

2 


^ • • • 


0 


0 


0 




-A 

32 


I 

n 

3 


0 


0 


0 




0 


... 


-A 

k-1,k- 


I 

2 n 

k-1 


0 






^ ... 


0 


-A 

k,k-1 

i 


I 

n 

kJ 


I 

n 

1 




0 


0 


• • • ^ 


0 


A 

21 




I 

n 

2 


0 


• • • ^ 


0 


A A 
32 21 




A 

32 


I 

n 

3 


• • • 0 


0 



k-1 



k-1 



k-1 



T7 A 

1=2 i,i-l 



TT A. . 

1=3 1 , 1-1 



TT A. , ^ 

1=4 1 , 1-1 



I 0 

n 

k-1 



TT A TT A TT A 

1=2 i,i-1 1=3 i,i-1 i=4 i,i-1 



A I 

k , k- 1 n 

kJ 
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r0 A 0 ... 0 

n 12 

1 



0 


A A 
21 12 


A 

23 




0 




AAA 
32 21 12 


A A 
32 23 




• • • 0 




k-1 


k-1 








■ (IT A. . ) A 

1=2 i,i-1 12 


(TT A. . 
1=3 1,1-1 


) A 

23 


... A 

k-1 ,k 




k 


k 






0 


(TT A. . )A 
1=2 1,1-1 12 


(TT A. . 
1=3 1,1-1 


) A 

23 


... A A 

k,k-1 k-1, k-1 



Frcm this representation of G, (i) follows directly. 

Now suppose that w # 0 is an eigenvalue of G 

T 

and Y is an eigenvector of G corresponding tc v, where Y = 
IT T 

(Y ,Y ,Y ). Then, wY = GY implies: 

12 k 



(3.30) wY = A Y 

1 12 2 



and, for j = 2,3,...,k-1, 



3 j 



(3.31) wY = A Y + 5 (TT A )A Y 

j 3f j'*'1 j'*'”' ~ i/i~1 s-1,s s 

s=2 i=s 



One conseguence of (3.31) is that, for j = 3,4,...,k, 
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(3.32) 



w£ Y 



j/j-1 j-1 



j-1 j-1 




s=2 i=s 



) i Y + A Y ] 

s-1 ,s s j-1 / j j 



j j 



2 (TT A. 



) A y 

i,i-1 s-1,s s 



s=2 i=s 



Thus, for j = 3,4, ,k-1, (3.31) and (3.32) result in: 



which is also seen to hold for j = 2 as a special case of 
(3.31). Finally, wY = GY and (3.32) imply: 



The relations (3.30), (3. 33) and (3.34) are a set of 
necessary and sufficient conditions for w 0 to be an 
eigenvalue cf G and Y an eigenvector of G corresponding to 

w. 

The following similar set of necessary and 
sufficient conditions for r 0 to be an eigenvalue of A and 

T 

X an eigenvector cf A corresponding to t, where X = 

IT I 

(X ,X ,...,X ), can be obtained directly from tae form of A 
1 2 k 

as specified in (3.9): 



(3.33) 



wY = A Y + wA y 

j j/j+1 j+1 j, j-1 j-1 



(3.34) 



Y = A Y 

k k,k-1 k-1 



(3.35) 



tX = A X 
1 12 2 
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(3.36) 



+ A X 

3 . 3-1 3-1 



for j = 2, . , 



k-1 



tX = A 



3. j+1 3 + 1 



(3.37) tX = A X 

k k,k-1 k-1 



A simple verification that the conditions (3.30), (3.33) and 

(3.34) are equivalent to the conditions (3.35), (3.36) and 

2 i-1 

(3.37) if w = t and Y = t X (for i = 1,2,...,k) new 

i i 

completes the proof. 



Corcllary Let A be a 2-cyclic, irreducible matrix in 

third canonical form. Ift,t, ...,t are the distinct 

1 2 m 

eigenvalues cf A numbered so as to satisfy (3.19), and if 



(t^l > |t_|, then the Gauss-Seidel method will converge to 
2 3 

2 T 

an eigenvalue w = t and an eigenvector X, X = 



11 I T-1T-2T 1-k I T 

(X ,X ,...,X ), where (X ,t X ,t X ,...,t X ) is an 

12 k 1i2i3 ik 



eigenvector cf A corresponding to t for i = 1,2. 

i 



Pro of; Since |t | > it | 

2 3 

unique dcninant eigenvalue of the 
matrix of A by Theorem 3.12. 



2 

and w = t = 
1 



Gauss-Seidel 



2 

t , w is the 
2 

iterative 



Ifae importance of Theorem 3. 12 and the corollary 
following it is brought into focus by the -realization that 
arrangement cf the states according to a lexicographic 
ordering cf the ISTATEmI or the ISTATEm2 vector 
representation results in a coefficient matrix A in third 
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canonical form for many of the 2-cyclic models discussed in 
the preceding chapter. For example, consider any central- 
server model in which all service distributions are 
exponential and no blocking or bulking can occur. At each 
transition the number of jobs at the CPU either increases by 
one or decreases by one. The result (see Appendix A) is 
that each such model is 2-cyclic. An ordering of the states 
which yields a coefficient matrix in first canonical fcrm 
will necessarily list all of the states having an odd number 
of jobs at the CPU followed (or preceded) by all of the 
states having an even number of jobs at the CPU. the 
ISTATEmI and ISTATEm2 orderings list all states with N jobs 
at the CPU, followed by those with N - 1 jobs at the CPU, 
followed by those with N - 2 jobs at the CPU, and so forth. 
Since the number of jobs at the CPU changes by exactly one 
with each transition, the resulting coefficient matrix is in 
third canonical form. 

The corollary to Tneorem 3.12 says that the 
Gauss-Seidel method will converge for these models. Note 
that since the Perron-Frobenius eigenvalue of A is one in 
these cases, the Perron-Frobenius eigenvector of G coincides 
with that of A. Thus, the Gauss-Seidel method will converge 
to the sought-after eigenvector. Because of this and the 
ease with which this method is programmed, the Gauss-Seidel 
method is the one used in the models programmed by the 
author (see Chapter IV) . 

4.4. Accel era ti ng C onvergence of the Gau ss-Seid el 
Met hod 

Note that Theorem 3.11 guarantees convergence cf the 
Gauss-Seidel method if A is k-cyclic and in first canonical 

k 

form. The measure of convergence is |t /t | by part (i) , 

k+ 1 1 
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in contrast to the to the relaxed power method where this 
measure must he expressed in terms of all of the eigenvalues 
of A. In the relaxed power method the dominant eigenvalues 
of A are transformed to distinct eigenvalues of the 

iterative matrix B . Under the conditions described in 

w 

Theorem 3.11, they are all transformed to the same 

(dcoiinant) eigenvalue cf the Gauss-Seidel iterative matrix G 
and, thus, will not affect the rate of convergence, as they 
could in the relaxed power method. 



It wculd be natural to next ask if it is possible to 
improve the convergence properties of the Gauss-Seidel 
method by incorporating a relaxation procedure. Indeed, 
since the first two eigenvalues of G are distinct, a choice 
of w > 1 may be appropriate. Incorporation of a relaxation 
procedure may take place in at least two ways. 



First, a weighted sum of the old vector and the new 
vector may be taken at each iteration. To specify this 
method, replace step (ii) of the power method as specified 
in the preceding subsection and replace step (iii) by; 

<i) (i) (i) 

(iii) Let b = ZY and X = (w/b ) Y + 
i i 



(1-w)X 



(i-1) 



(i) (i~1) 

Note that X will be normalized since both X and 



-1 (i) 

b Y are. 
i 



Second, a weighted sum may be taken as the new 
vector is generated. To specify this method, replace step 
(ii) by: 

(i) 

(ii) Calculate Y by letting 
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(i) 



n 



= w a X 
~ Ik k 
k=1 



(i-1) 



+ (1-w)x 



(i-1) 

1 



and, for j = 2,3,...,n, letting 



(i) 
y .■ 

3 



3-1 n 

, ^ (1) ^ (i-1) (i-1) 

«[ Z a y + ? a X j + (1-w) x 

3k k 3k k -j 

k=1 k=3 



Neither cf these two methods has been investigated, either 
analytically or empirically. 



Another promising acceleration technique is informed 
choice of starting vector. Note from (3.20) that choice of 

( 0 ) 

startrng vector X as nearly parallel to eigenvector as 

possible will cause small values of c ,c ,...,c and earlier 

2 3 m 

convergence cf the iteration method. Such an informed 
choice must be based upon the physical situation modelled 
and the anticipated properties of the final solution. 

Consider now an irreducible central-server model as 
discussed in the preceding chapter. Perr on-Frobenius theory 
tells us that the solution vector (to (3,1), (3.3) and 

(3.U)) is strictly positive; that is, all components have 
positive values. As remarked in subsection 4.1, a choice of 
(ncnnull) starting vector with no negative components will 
guarantee convergence of the power method if the model is 
acyclic. In the case of the relaxed power method, any 
nennegative starting vector will guarantee convergence if 0 
< w < 1, and in a wider range of values for w if the modal 
is acyclic. Theorems 3.11 and 3.12 indicate that more care 
must be taken in the choice of a starting vector for the 
Gauss-Seidel method. However, a positive starting vector is 
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sufficient fcr all three methods. The starting vector X 

T 

= (VQ)J. = ( 1/n, 1/n . ,1/n) is one such choice which is 

easy to iaplement. Of course, this is an arbitrary choice 
and consideration of the model parameters may lead to better 
choices. 



For example, a product-form solution such as that 
discussed by Gordon and Newell [46] can provide a starting 
vector. Gaver [38] has investigated Gordon and Newell's 
fcrmulaticn for the central- server model with a single job 
type and exponential servers. For this case the 
steady-state probability that the system is in a given state 

J1 J2 JN 

is proportional to r r ...r where for each m = 

1 2 N 



1,2,...,M, Jm is the number of jobs at PPm in the given 



state and r is the product of the service rate at the CPU 
m 

and the branching probability to PPm divided by the service 

rate at PPm. Since, in general, different job types have 
different service rates and branching probabilities, several 
states of a system such as those described in the preceding 
chapter have J1 jobs at P?1, J2 jobs at PP2, ..., JM jobs at 
PPM, and some technique must be devised for developing a 
starting vector from a product-form solution. One technique 
is to use composite, or averaged, service rates and 
branching probabilities. Once the product indicated above 
has been determined, it can either be assigned to each state 
having Jm jobs at PPm (m = 1,2,...,M) or split among them. 

Jm J 1 m J2m 

Another technique is to replace r by r r where Jim is 

m 1 m 2ra 

the number of type-i jobs at PPm and r is the product of 

im 



I 
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4 




'Si 



I 



the service rate of type-i jobs at the CPU and the branching 

probability of type-i jobs to PPm divided by the service 
rate of type-i jobs at PPm. 

Another example of how system parameters might be 
used to provide a good starting vector is to base the 
relative values of the components of the starting vector 
upon the following considerations: 

(i) If the service rates at the CPU are large as 
compared to those at the PP*s, and if II is 
relatively small, there is a smaller likelihood of 
finding a large number of jobs at the CPU than a 
small number. The opposite is true if the service 
rates at the CPU are of the same order as or 
smaller than those at the PP's. 

(ii) If the branching probability of type-i jobs 
to EPj is significantly smaller than that of the 
same type jobs to PPk, there is a smaller 
likelihood of finding a given number of type-i 
jobs at PPj than finding the same number at PPk. 

(iii) If the service rate of type-j jobs is 
smaller than that of type-k jobs at a given 
processor, there is a smaller likelihood of 
finding a type-k job in service there than a 
type-j job. 

These guidelines are purposefully vague and no 
indication has been given as to how they might be used to 
develope a starting vector. The question of acceleration of 
the Gauss-Seidel iterative method, in general, and choice of 
a ’’good*’ starting vector, in particular, is an open area for 
research . 
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IV. 



PROGRAMMED MODELS 



This chapter is devoted to a discussion of three models 
which have been programmed using the techniques developed in 
Chapter II. The programs have been written in FORTRAN and 
are currently operational on the IBM 360-67 at the Naval 
Postgraduate School in Montery, California. In the first 
section the acdels and programs are discussed. The second 
section contains the results of a variety of numerical 
examples designed to contrast the models to each other and 
to approximate product-form solutions of some of the same 
problems . 

1. DESCRIPTION OF MODELS AND PROGRAMS 

The first subsection of this section contains a 
discussicn of the models which have been programmed. The 
second subsection considers the similarities in the three 
pregrams, and the final subsection considers their 
differences. 



1.1. The Mod els 

All three models are closed central-server models as 
depicted in figure 1.1. Service distributions at all 
processors are exponential. Two types of jobs are 
considered, and service rates and branching probabilit ies 
depend upen job type. 
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The first model is the key example discussed in 
Chapter II. Each processor is a FCFS, single server queue. 
Hereafter, this model is called the FCFS model. Appendix D 
contains a listing of the FORTRAN program associated with 
this model. In the remainder of this chapter this program 
will be referred to as the FCFS program. 

The second model is identical to the FCFS model 
except that the CPU has a PS queuing discipline. For this 
reason this iicdel is hereafter called the PS model. A 
listing cf the associated program, the PS program, is found 
in Appendix E. 

In the third model PP1 is an IS queue which services 
only type-twc jobs (ALFA (1,1) = 0, to use notation 
introduced in the next subsection) , and the CPU is a 
multiserver FCFS queue. An application of this model will 
be discussed in section 3 of Chapter VI. In this 
application PP1 represents all of the tape drives in a 
computer system, and the multiserver CPU represents multiple 
CPU's with a common waiting line. Only type-two jobs 
require the use of tapes. This model is hereafter called 
the tapes model. A listing of the tapes program is found in 
Appendix F. 

Having introduced the models, attention is now 
directed toward the programs. 

1.2. Simi larit ies in the Pr ogra ms 

The major variable names are common to rhe three 
programs. Appendix G contains a glossary of these variable 
names. Several are also defined in this subsection. ethers 
have already been used in Chapter II. 
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The Dumber, NPP, of FP's is an input parameter in 
all three programs. (In previous chapters this number has 
been represented by M, but in the programs it is NPP.) NPP 
is restricted to be no smaller than two and no larger than 
nine. The Icwer bound is a natural restriction in the tapes 
model (the type one jobs have to route somewhere when they 
complete service at the CPU) and is necessary for 
irreducibility in the FCFS model. The upper bound is to 
help insure that the right subvector (see Chapter II, 
subsection 3.1) can be stored as a single- precision integer. 
This bound can be relaxed, but not without a careful 
reprogramming effort. 

The number, N1, of type-one jobs and the number, N2, 
of type-two jobs are input in all three programs. Both N1 
and N2 are restricted to be nonnegative and their sum must 
not exceed nine. The upper bound is to insure that the left 
sutvector (see Chapter II, subsection 3.1) can be stored as 
a single-precision integer. Together with the upper bound 
on NPP, the upper bound for N1 + N2 insures that the right 
subvector can also be so stored. 

The tapes program will reject any case in which N1 = 
0 or N2 = 0. If N1 = 0, the FCFS program and the PS program 
will decrease N2 by cne, increase N1 to one, replace the 
service rates and branching probabilities for type-one jobs 
by those for type-two jobs, and solve the problem as though 
there were actually two types of jobs. (The case in which 
N2 = 0 is treated similarly.) Because of this N1 + N2 is 
restricted tc be no smaller than two. The reader is 
cautioned, however, that output referring to job type (for 
example, the probability that PP2 is busy with a type-cne 
jot) loses meaning when N1 = 0 or N2 = 0 initially. Note 
that any case in which there is only one job type can be 
solved using a product— form solution. For the FCFS model 
the reader is referred to Gaver [38]. For the other two 
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models see Easkett, Chandy, Muntz and Palacios [6]. 

For each job type i = 1,2 and each PPj, j = 
1,2,...,NPP, the branching probability, ALFA(i,j), of a type 
i job to PPj is also an input parameter. If, for some j = 

-6 

1, 2, SHE, ALFA(i,j) < 0.5 x 10 for both i = 1 and i 

= 2, then PPj is eliminated from consideration, the PP's are 
renumbered (if j ^ NPP) , and NPP is decreased by one. NPP 
must satisfy the restrictions indicated above after all such 

-6 

trimming has taken place. If |ALFA(i, j) 1 < 0.5 x 10 for 

some i and j, then ALFA(i,j) is set equal to zero. If any 
ALFA(i,j) is negative, an error message is printed and the 
problem is net solved. 

For each job type i = 1,2 and each processor j = 1, 

2, ..., NPP + 1 (processor NPP + 1 is the CPU), the service 
rate, RAIZ(i,j), of type i jobs at processor j is an input 

-6 

parameter. If any such rate is smaller than 0.5 x 10 , an 

error message is printed and the problem is not solved. 

Note that tests are performed in the order that the 
restrictions are presented here. Hence, a negative 
branching probability or service rate will not cause 
abortion of the problem if the affected PP is eliminated, or 
if it is for type-one jobs and N1 = 0. 

Each program is divided into a main routine and 
seven subroutines: 

MAIN: Controls flow of the program; contains read 

statements for input; develops balance equations. 

CKOUT: Checks input parameters for adherence to the 

above restrictions. 

INIT: Stores the right and left subvectors and 
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associated pointer vectors. 

LOOKUP: Calculates the state number from the vector 

representation. 

PIG: Calculates the number of combinations of n + 

1 

n things taken n at a time. 

2 1 

GSSOL: Selves the balance equations. 

EENTE: Prints the steady-state probability 

distributions (no more than one thousand states) . 

ACCUM: Calculates and prints the measures of system 

performance. 

Subroutines PIG, GSSOL and PRNTP are the same for all three 
programs. CKCDT is the same in the FCFS and PS programs. 
Although the coding differs in the tapes program, the same 
tests are performed in CKOUT of the tapes program. 

Ihe Gauss-Seidel iterative method, discussed in 
subsection 4.3 of Chapter III, is used in GSSOL to solve 
the balance equations. Each component of the starting 
vector is equal to 1/NSTATS, where ESTATE is the number of 
states (and balance equations) . The method is considered to 
have converged if 



(4.1) max 



1<i<NSTATE 



(n) (n-1) (n) -6 

IX - X I / X < 0.5 X 10 

i i i 



(n) (n-1) 

where X and X are the i-th components of, 

i i 

respectively, the n-th and the (n-l)-st generated vector, 

(n) (n) . 

The solution method guarantees that X > 0. X. is 

i 1 



assumed to be zero if it is calculated to be less than 
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-27 (n) 

0.5 X 10 .In the case that X is zero, the fraction in 

i 

(n-1) 

(4.1) is set equal to zero if X is zero or one if not. 

i 

If convergence has not been attained in five hundred 
iterations of the Gauss-Seidel method, a message is printed 
and the problem is aborted. The convergence criterion used 
here is, ' admittely , very stringent. However, it permits the 
luxury of not normalizing until convergence has been 
attained. All variables and calculations are single 
precision . 

The (nonop tional) output for all three programs 
consists of the input parameters, the total number of 
states, the number of nonzero terms in the balance 
equations, the number of iterations for convergence of the 
Gauss-Seidel method, and for each processor: the idle 

probability, the probability that it is busy with each type 
of job, the average occupancy, and the average length of 
waiting line. Optional output includes the state vectors, 
the balance equations and up to one thousand steady-state 
probabilities. 

Having considered the similarities in the three 
programs, a discussion of the differences is in order. 



1.3. Pi f f ere nc es in the P ro gram s 

For the tapes program the number, MP, of CPU's (i. 
e., the number of servers at the CPU) is an input parameter. 
This number is set equal to one if a nonpositive number is 
provided through the read statement. Note that the CPU can 
be made an irfinite server queue by setting the number of 
servers equal to any number greater than or equal to N1 + 

N2. 
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Ihe seguencing of the states is according to a 
lexicographic ordering of the ISTATEml vector 
representations for the FCFS and PS models and according to 
a lexicographic ordering of a slightly modified version of 
the ISTAI£m2 vector representations for the tapes model. 

The modification used in the tapes model would list the jobs 
at PPl (necessarily all twos) at the right end rather than 
the left. end of the left subvector. 

In addition to the nonoptional output indicated in 
1.2, the tapes program prints the average number of each 
type jot at each processor, the average number of CPU's busy 
with each type of job, the average number of CPU's which are 
idle, the distributions from which many of these averages 
are calculated (e.g., the distribution of the number of idle 
CPU's), and throughput for each processor. The throughput 
of a processor is the average number of jobs completing 
service at that processor per unit time. It may be 
calculated for a single server FCFS processor, for example, 
by multiplying the processing rate for each type job by the 
proportion of time the server is busy with that type of job 
and accumulating over all job types. 



2. NUMEBICAL RESULTS 



Numerical results from utilization of the three programs 
discussed in the first section of this chapter are given in 
this section. The results given in the first subsection 
show how the three models can to ways in which some of these 
models might be approximated by models be compared for a 
particular example. The second subsection is devoted 
satisfying the conditions of local balance. The chapter is 
concluded with some indications of running times and core 
requirements for relatively large jobs. 
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2.1. Use of the P rogram s to E xamine C ha nges in CPU 
Configura ti on 



Easkett, Chandy, Muntz and Palacios £6] considered 

the network depicted in Figure 4.1. The queuing discipline 

at the CPU is PS and each peripheral is a single server FCFS 

queue. The rates for the exponential service distributions 

are given below the box representing the corresponding 

processor (for example, r is the service rate of type-one 

1 5 

jobs at the CPU) . Only the nonzero branching probabilities 

are given in Figure 4.1. Since each peripheral sees only 
one type of job, the system satisfies the conditions of 
local balance. Thus, Baskett, et. al. were able to use a 

I 

product-form solution to determine the utilization of each 
processor as the number of type-one jobs increased and the 
number of type-two jobs remained fixed at one. The idleness 
probabilities (one minus the utilization) for the cases 
reported by Easkett, et. al. are given in the first column 
of Table 4.1. 

Note that this example is one which the PS model is 
designed to handle. The PS program was used to verify the 
results reported in [6] and to provide the average 
occupancies and throughputs as reported (for the same 
example) in the first column of Tables 4.2 and 4.3 
respectively . 

The FCFS program was used to give the second column 
in these three tables. A comparison of the first two 
columns of each table reveals the effect of changing the PS 
CPU to a FCFS CPU. Note that the differences become mere 
pronounced as the number of jobs increases. 
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1 



I 




Figure 4.1--Network of Baskett, et. al. Example 
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TABLE 4.1 

ICLENESS PH03ABILITY FOR BASKET!, et. al. MODEL 



gueuxng 

discipline 


PS 


FCFS 


FCFS 


FCFS 


ZQFS. 


.£CFS. 


FCFS 


FCFS 


FCFS 


Number CPL 


t* s 


1 


1 


2 


3 


4 


5 


6 


7__ 


8 


Processor 


N 1 




















CPU 


1 


.322 


. 320 


.634 


.765 


.817 


.854 


.878 


.895 


. 909 


CPU 


2 


.280 


. 271 


. 598 


.731 


.799 


. 839 


.866 


. 885 


. 899 


1 CPU 


3 


. 256 


. 238 


.577 


.717 


.788 


.830 


.359 


, 879 


, 894 


1 CPU 1 


4 


.241 


.214 


.564 


.709 


.782 


.825 


. 854 


. 875 


.891 


CPU 


C 


. 231 


. 197 


.556 


.704 


.773 


.822 


.857 


. 873 


. 889 


CPU j 


6 


. 225| 


. 184 


.551 


. 700 


.775 


.820 


.850! 


.872 


.888 


CPU 


7 


.221 


. 175 


.548 


.698 


. 774, 


.819 


.849 


. 371 


.887 


PP1 ' 


1 


. 629 


.6 05 


. 600 


.600 


. 600 


.600 


.600 


. 600 


.600 


PP1 


2 


. 648 


.6 10 


.600 


.600 


,60 0 


,600 


.600 


. 600 


. 600 


PP1 




.66 1 


.615 


. 600! 


.600 


.600 


,600 


.600 


. 600 


. 600 


PP1 


4 


. 670 


.620] 


.601 


.600 


.600 


.600 


.600 


. 600 


. 600 


PP1 


5 


. 676 


. 6 24 


.60 1 1 


.600 


,600] 


,6001 


.600 


.600 


. 600 


?P1 


6 


.67 9 


.6 28 


.601 


.600 


.600 


.600 


.600 


.600 


.600 


PP1 


7 


.68 2 


.631 


.601 


.600 


.600, 


.600 


.600 


.600 


.600 


PP2 


1 


. 616 


.726 


.536 


.5 86 


. 586 


.586 


.5 86 


.586 


.586 


PP2 1 


2 


.354 


. 546 


.357 


.352 


.352 


.352 


,352 


. 352 


.352 


PP2 


•3 


. 25 7 


.4 17 


.2241 


.218 


.218 


.218 


.213 


.218 1 


. 218 


PP2 


4 


. 169 j 


.320 


. 143 


. 138 


.138 


. 138 


.138 


. 138 


. 138 


PP2 


c 


. 1 12 


.2461 


. 093 


.089 


. 089 


. 089 


.089] 


. 089 


.089 


PP2 


6 


.074 


.188 


. 06 1 


.058 


. 058 


. 058 


.058 


. 058 


.053 


PP2 


7 


.049 


. 143 


. 040 


.038 


.038 


.038 


,038 


. 038i 


. 038 


PP3 


1 


.744 


.817 


. 724 


.724 


.724: 


.724 


.724 


.724 


.724 


PP3 


2 


. 596 


.6 97 


. 57 1 


.568 


. 568 


. 568 


. 56 3 


• 568 


.56 8 


PP3 




.505 


.611] 


. 483 


.479 


.479 


.479 


.479 


. 479 I 


. 479 


PP3 


4 


. 446 


. 547 


.429 


. 426 


. 42 5 


, 425 


.425| 


. 42 5 I 


. 42 5 


PP3 


c 


. 408 


.4 97 


.395 


.393 


.393 


. 393] 


.393] 


. 393 , 


. 39 3 


PP3 


6 


.3831 


.4 59 


. 374 


.372 


. 372 


. 372 


.372 


. 372 I 


. 372 


PP3 


7 


, 366 


. 429 


.3601 


.359 


.359 


.359 


.359 


.359 


.35 9 


PP4 


1 


.335 


. 883 


. 322 


. 822 


. 822 


, 822 


.822 


. 822! 


.822 


PP4 


2 


.740, 


. 805 


,724 


. 722 


.722 


.722 


.722 


, 722, 


.722 


PP4 




. 682 


.750 


.667 


.665 


.665, 


.665 


.665 


. 665 


.66 5 


PP4 


4 


,644 


.709 


.633 


.631 


. 631 


1.631 


.631 


.631, 


. 63 1 


PP4 


5 


.6 19 


.677 


,611 


.610 


. 6 1 0 i 


1.610, 


.610 


.610 


.610 


PP4 


6 


.603 


.652 


.597 


.596 


.596 


j .596 1 .P96 


. 596 


. 596 


PP4 


7 


.593 


L. 633 


. 58 8, 


.588 1 . 588 


j . 58 8 1 . 588 


.588 


. 588 
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TABLE 4.2 

AVEBAGE OCCUPANCIES FOB EASKETT, et . al. MODEL 



C?l3 queuing 
discipline 


PS 


FCFS 


[Pci’S 


FCFS 


FCFS 


FCFS 


FCFS 


FCFS 


,?^S„ 


Number CPU's 


1 


1 


2 


3 _^ 


4_. 


5_. 


6 


2_. 


8 


Processor 

CPU 


1 


.824 


1 . 03 


. 732 ! 


.732 


. 732 ' 


.732 


.732 


. 732 


.732 


CPU 


2 


.999 


1.41 


.8191 


.806 


.806 


.806 


.806 


. 806 1 


. 806 


CPU 


3 


1.13 


1. 74 


. 877 


.849 


. 848 


.848 


.848 


. 848 


.848 


CPU ^ 




1.23 


2.04 


.916 


.876 


.874 


.874 


.874 


. 874 


. 874 1 


CPU 


5 


1. 30 


2.30 


. 94 1 


.8 92 


. 889 


.889 


.889 


.889 


, 889 


CPU 


6 


1.36 


2.52) 


. 959 


.903 


.899) 


.899 


.899 


. 899 


.899 


CPU 


7 


1.39 


2.72 


. 971 


. 909 


. 906 


.905 


.905 


. 905 


. 905 


PP1 


1 


.37 1 


. 395 


. 400 


.400 


. 400 


.400 


.400 


. 400 


. 400 


PP1 


2 


.3521 


.3 90 


.400 


.400 


.400 


.400 


.400 


.400 


.400 


PP1 1 


3 


.339 


.385 


. 400 


.400 


.400 


.4001 


.400 


. 400 


. 400 


PP1 


4 


.3301 


.3 80 


.399) 


.400 


.400 


.400 


.4001 


. 400 1 


. 400 


PP1 


5 


.324 


.376 


.399 


.400 


.400 


.400 


.400 


. 400 1 


, 400 


PP1 


6 


. 321 


.3 72 


.399 


.400 


.400 


.400 


.400 


.400 


. 400 


, PP1 


7 


.318 


.369 


.399 


.400 


.400 


.400 


.400 


. 400 


. 400 


PP2 


1 


. 384< 


. 274 


.414 


.414 


.414 


.4 14 


.414 


. 414 


.414 


PP2 


2 


. 839 


.6 02 


.910 


.916 


.916 


. 916 


.916) 


.916 1 


.916 


PP2 I 


3 


1.37 


.980( 


1.48 


1 . 50 


1.50 


1 . 50 


1.501 


1 .50 


1.50 


PP2 ' 


4 


1.97 


1.41 


2. 1 3 


2.151 


2. 15 


2. 15 


2. 15 


2. 15 


2.15 


PP2 


5 


2.63 


1. 89 


2.83 


2. 87 


2.87 


2. 87 


2.87 


2.87 


2.871 


PP2 


6 


3.36 


2.42 


3.60 


3.651 


3.65 


3.65 


3.65 


3.65 


3.65 


PP2 i 


7 


4. 15 


3. 00 


4.42 


4.47 


4.47 


4.47 


4.47 


4.47 


4.47 


PP3 


1 


. 256 


. 183 


.276 


.276 


.276 


. 276 


.276 


. 276 


.276 


PP3 


2 


.508 


. 373 


.547 


.551 


.551 


.551, 


.551 


.551 1 


.551 


PP3 


3 


.747 


.567 


.301 


. 308 


.809 


. 809 


. 809 1 


. 809 


. 809 


PP3 


4 


. 968 


.758 


1 . 03 


1.04 


1.04 


1.04) 


1 .04 


1.04) 


1.04 


PP3 


5 


1.16, 


.9 44 


1.23 


1 . 24 


1.24, 


1.24 


1.24 


1.24 


1.24 


PP3 


6 


1. 34 


1 . 12i 


1.39 


1.41 


1.41i 


1 .4 1 1 


1.41 


1.41, 


1.41 


PP3 1 


7 , 


1.48 


1.29 


1.53 


1. 54 


1.54 


1 . 54 


1, 54 


1 . 54 


1.54 


PP4 


1 


. 165 


. 1 17 


. 178 


.178 


.178 


,178 


.178 


. 178 


. 178 


PP4 


2 


.303 


. 226 


.325 


.327, 


.327 


.327 


.327i 


. 327 


. 327 


PP4 


3 


.415] 


.325 


. 44 1 


.4 45 


.445| 


.445 


.445| 


. 445 


. 445 


PP4 


4 


. 504| 


.4 14 


.529 


.533 


.534 


.534 


.534 


. 534 


. 534 


PP4 


5 


.57 2 


. 493 


.595 


.599 


. 599 


.599 


.599 


. 599 


. 599 


PP4 


6 


.624 


.562 


.64 2 


. 645 


.646] 


.646 


.646 


.646 


. 646 


PP4 


7 


.661 


.621 


.ip76j 


.678 


.1.679. 


.679J 


.679 


.679 


.679 
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TABLE 4.3 

IHfiOUGHPUT ?OR BASKETT, et. al. MODEL 



'■~CPTJ queuing 
discipline 


PS 


FC?S 


FCFS 


.FCFS. 


.ICF^ 


-£CFS‘ 


FCFS 


FCFS 


FCFS 


Number CPU's 


1 


1 


2 


3 


4 


5 


6 


. 1 , 


8_. 


Processor 


Ji- 




















CPU 


1 


1.65 


1. 38‘ 


1 . 73 


1.78 


1.78 


1 .78 


1 .78 


1 .78 


1.73 


CPU 


2 1 


2.26 


1.88 


2. 44 


2.45 


2.45 


2.45 


2.45 


2.45 


2.45 


CPU 


3 


2.63 


2. 24 


2.82 


2.83 


2. 83 


2. 83 


2. 83 


2.83 


2.83 


CPU 


4 


2.87 


2. 51 


3.05j 


3. 06 


3.06 


3.06 


3.06 


3.06 , 


3.06 


CPU 


c 


3.02 


2.72 


3.19 


3.20 


3.20 


3.20 


3.20 


3.20 1 


3.20 


CPU 


6 


3. 13 


2. 88i 


3. 28 


3.29 


3.29 


3.29 


3.29 


3.29 


3.29 


CPU 


7 


3.19 


3.00 


3.34 


3.35 


3.35 


3.35 


3.35 


3.35 


3.35 


PP1 


1 ' 


.556 


.593 


.600 


. 600 


. 600 


.600 


.600 


. 600 


.600 


PP1 I 


z 


. 527 


. 585 


.60 0 


. 600 


.600 


.600 


.600 


. 600 


. 600 


PP1 




.506 


. 577 


.599 


.600 


.600 


.600 


.600 


. 600 


.600 


PPI 


4 , 


.495 


. 570 


. 599 


. 600 


. 600 


.600 


.600 


. 600 


.600 


PP1 


C 


.487j 


.564 


.599 


.600 


.600 


.600 


.600 


. 600 


.600 


PPI 1 


6 


. 481 i 


.558 


. 599, 


.600 


. 600 


.600 


.600 


. 6O0 


. 600 


PPI 


7 


. 477 


.553 


.599 


.600 


.60 0 


.600 


.600 


. 600 


.600 


PP2 


1 


. 384 


. 274 


.4 14 


.414 


.414 


. 4 14 


.4 14 


.414 


.414 


PP2 


2 


.606 


. 454 


. 643 


.647 


.647 


.647 


.647 


. 647 


. 647 


PP2 




.743 


.5 83 


. 7761 


.782 


.782 


.782 


.782 


.782 


.782 


PP2 1 


4 


.831 


.680 


. 857] 


.861 


.862 


. 862 


.862 


. 862 


. 862 


PP2 


5 


.888 


. 754 


.907] 


.911 


.91 1 


. 91 1 


.911 


. 911 


.911 


PP2 i 


6 j 


. 926 


.812 


. 9391 


.942 


.942 


.942 


.942 


. 942 


. 942 


PP2 


7 


.951 


.857 


. 960 


.962 


. 962 


. 962 


.96 2 


. 96 2 


.96 2 


PP3 


1 


. 38 4 


. 274 


.4 14 


.4 14 


.414 


. 4 14 


.4 14 


. 414 


. 414 


PP3 


2 


.6 06 


. 454 


. 643 


.648 


. 648 


. 648 


.648 


. 648 


. 6481 


PP3 




.743 


.5 83 


.776 


.782 


.782 


.782, 


.7821 


. 782 


.782 


PP3 


4 


.831 


. 680! 


.8571 


. 862 


.862 


. 862] 


.36 2 


. 862 


. 862 


PP3 


5 


.388 


. 754 


. 907 


.91 1 


.91 1 


.911 


.911 


.911 


.911 


PP3 


6 , 


. 926 


.812 


. 939 


.942 


. 942 


. 942 


. 942 


. 942 


. 942 


PP3 


7 


.951 


. 857 


.960 


.962 


. 962 


. 962 


.96 2 


. 96 2 


.962 


PP4 ' 


1 


. 329 


. 235 


.355 


.355 


.35 5 


.3 55 


.355 


. 355 


. 355 


PP4 


2 


. 520i 


. 389 


.551 


.555 


.555 


.555 


.555 


. 555 


.555 


PP4 




.637 


.5001 


.665 


.670 


.670i 


.670 


.670 


.670 


. 670 


1 PP4 


4 


.712 


.583 


.734 


.739 


.739 


.739 


.739 


. 739 


. 739 


PP4 


5 


. 76 1 


.6 46 


.778 


.781 


.781 


.781] 


.78 1 


.781 


.78 1 


PP4 


6 


.793 


.696 


.805 


. 807 


. 808] 


. 808 


.303 


. 808 


. 803 


PP4 


7 J 


.81 5j 


. 734 


. 823j 


. 825 


. 82^ 


. 825 


. 825 


. 825J 


. 825 
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since only type-two jobs are allowed to route to 
FP1, and since only one type-two job is considered, PP 1 is 
an IS queue and the tapes program can be used to determine 
the effect cf increasing the number of (FCFS) CPU's. This 
was done and the results are also reported in the three 
tahles. 

Consider in more detail, now, the effect of 
replacing a PS CPU with a FCFS CPU in a model such as that 
depicted in Figure 4.1. Since the operations are exactly 
the same in the two models except when the system is in a 
state in which two or more jobs occupy the CPU, only these 
states will be considered. In the PS CPU each job receives 
service immediately upcn arrival. However, the rate of this 
service is degraded according to the number of jobs which 
occupy the CPU. In the FCFS CPU there is no degradation of 
service. This fact supports a tendency toward shorter 
occupancy times (waiting time plus service time) for 
individual jobs and, therefore, greater throughput for the 
FCFS CPU than for the PS CPU. However, this tendency is 
counterbalanced by the fact that jobs must wait in turn for 
service in the FCFS CPU, but receive service immediately in 
the PS CPU. This latter effect becomes dominant if the 
service rates for one type of job is larger at rhe CPU and 
the PP's than that for the other type of job is at the CPU. 

Such a case is exemplified in the model by Baskett, 
et. al. [6] discussed above and depicted in Figure 4.1. In 
this example the service rates for the type-one jobs are 
larger at most PP's and much larger at the CPU than the 
service rate for type-two jobs at the CPU. In the PS model 
a type-one job arriving at the CPU after a type-two job 
might complete service at the CPU, receive service at cne of 
the PP's and return to the CPU several times before the 
type-two job leaves the CPU. In the FCFS model the type-one 
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jot would be forced to wait for service until the type-two 
job left the CPU. As emphasized in the preceding paragraph, 
the effects of degraded service in the PS model are 
counterbalanced by the effects of forced waiting in the FC?S 
model. However, as can be seen in Table 4.3, the net 
result in the cases for which data are given is a decrease 
in throughput at the CPU when the PS CPU is replaced by a 
fCPS CPU.' This results from a decrease in the throughput 
for type-cne jobs and an increase for type-two jobs, as can 
be seen by examining the throughput at the PP's. From 
Tables 4.1 and 4.2 it is also seen that idleness 
probabilities decrease at the CPU and PP1 and increase at 
PE2, PP3 and PP4, and average occupancies increase at the 
CPU and PP1 and decrease at PP2, PP3 and PP4. 



The difference in results between the PS model and 
the FCFS model increase as the number, N1, of type-cne jobs 
increases, while the number of type-two jobs remains 
constatn at one. The reason for this is that the 
degradation of service in the PS CPU becomes less serious 
for the type-one jobs and more serious for the type-two job 
as the number of jobs at the CPU increases. For example, 
with one job of each type at the CPU, the rate of departure 
of jobs from the CPU is 9/2 for type-one jobs and 1/2 for 
type-two jobs. Increasing the number of type-one jobs to 
four changes these rates to 36/5 and 1/5. 



Finally, consider the effect of increasing the 
number of (FCFS) servers at the CPU. Note that in those 
cases in which the number of type-one jobs is smaller than 
the number of CPU's, the CPU has an IS queuing discipline. 
From this point on, increasing the number of CPU's has no 
effect on the operation of the network. The reason that 
idleness probabilites in Table 4.1 continue to change past 
this point for the CPU is that they were found by dividing 
the average number of idle CPU's by the total number of 
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CPU'S. If each idle CPU has an equal chance of being chosen 
by a newly arriving job, this idleness probability is the 
proportion of time chat each CPU is found idle. Since these 
IS CPU cases provide immediate, full service (as opposed to 
the delayed service of the PCFS CPU and the degraded service 
of the PS CPU) throughput is a maximum for all processors. 
Idleness probability is small and average occupancy large 
(relative to all other cases reported for the same job mix) 
at the PP's. 

As the number of servers at the CPU increases from 
one to N 1 + 1, throughput at all processors increases. The 
reason for this is obvious at the CPU and nearly so at the 
PP's: More servers at the CPU means more rapid return of 

jobs to the PP's, resulting in a greater proportion of time 
spent serving jobs. Similarly, the idleness probabilities 
at the PP's decrease and the average occupancies increase. 
The result is a corresponding decrease in average occupancy 
and a resulting increase in idleness probability at the CPU. 
Note that in each case there is a point beyond which 
increasing the number of CPU's (that is, the number of 
servers at the CPU) does not significantly improve the 
system. For example, with N1 = 7 there are no changes in 
the measures of effectiveness (down to the third significant 
digit) fcr the PP's, and little change in those for the CPU, 
as the number of servers at the CPU is increased beyond 
three. The reason for this phenomena is that the PP's have 
become the bottleneck at this point. Judging from the 
results given in the three tables, increasing the number of 
CPU's beyond two would probably not be justified if the job 
mixes given are representative of those which would be seen 
by the computer modelled. 
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2.2. App roxim ation by L oc al B ala nce Networ ks 



In this subsection consideration is given to ways in 
which a queuing network which does not satisfy the local- 
balance conditions might be approximated by one which does. 
The goal .is to find an approximation whose steady-state 
properties are vary nearly the same as those of the original 
network. If such an approximation can be found, much time 
and effort cculd be saved by taking advantage of the 
product-fcrm solution to the approximate model. 

Cf particular interest is the FCFS model discussed 
in section 1. This model does not satisfy the local-balance 
conditions because it contains at least one FCFS queue at 
which jobs cf different types receive service at different 
rates. As can be seen from subsection 5.3 of Chapter I, 
the work cf Haskett, Chandy, Muntz and Palacios [6] 
indicates that local balance would prevail if the queuing 
discipline at these queues were PS, IS or LCFSPR rather than 
FCFS, or if the processing rates were the same for the two 
jcb types at each FCFS processor. These observations will 
form the basis for most of the approximations discussed 
here. 



Reviewing the example of Baskett, et . al. expanded 
upon in subsection 2< 1 , it is seen that replacing the FCFS 
CPU by either an IS or a PS CPU could lead to significant 
differences in the measures of system performance (see 
Tables 4.1, 4.2 and 4.3). The reason for this is that the 
rapidly- mcvi tg type-one jobs must at times wait at the FCFS 
CPU for the slew-moving type-two jobs to complete service. 

In the IS CPU there is no interference between the two types 
of jobs. In the PS CPU there is interference in the form of 
degraded service, but the extreme difference in service 
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rates makes this interference considerably less severe than 
that in the PCFS CPU. 

If the FCFS CPU were replaced with a LCFSPR CPU in 
the same model, seme interference of the type encountered in 
the FCFS CPU would- occur. The completion of service of any 
type-one job at the (LCFSPR) CPU, whether the type-one job 
was enqueued or in service when the type-two job arrived, 
would necessarily wait until the type-two job had completed 
its service. However, the high service rate of the type-one 
jots at the CPU as compared to their rates at the PP's makes 
the probability that the type-two job will catch one or more 
type-one jobs at the CPli when it arrives rather small. On 
the other hand, the chances are good that the type-two job 
will be interrupted (perhaps, several times) while at the 
CPU. The anticipated result is an increase in idleness 
probability and a decrease in average occupancy and 
throughput at PP 1 and the reverse of this at PP2, PP3 and 
PP4. These facts are easily seen to be true since a careful 
reading of [6] reveals that the idleness probabilities, 
average occupancies and throughputs for this example are the 
same whether the CPU has a PS queuing discipline or a LCFSPR 
queuing discipline. That is, "PS" in Tables 4.1, 4.2 and 
4.3 could be replaced by "LCFSPR." 

So it is seen that simple replacement of a FCFS 
processor by_a PS, IS or LCFSPR processor with the same 
service distributions may result in significantly different 
values for the measures of system performance. It may be 
possible to arrive at better approximations by also changing 
the service distributions (for example, by adjusting the 
service rates) . However, the types of changes to make are 
not obvious if the queuing discipline is also changed. 

In [6] Basketz, et . al. investigated an 
approximation to the above model in which the two classes of 
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jots are replaced by a single "equivalent'* class of jots. 
For each value of N1 they retained the given service rates 
at the Pi's and replaced the service rates at the CPU ty 

(4.2) = (g^ . g^) / * g^/r^^) 



and the branching probabilities by 



(4.3) 



(gp +gp )/(g + q) 

1 1j 2 2j 1 2 



where g is the rate at which class i jobs leave the CPU. 
i 

These rates are determined from the solution of the original 

mcdel. These choices for the rate at the CPU and the 
branching probabilities for the "equivalent" class of jobs 
are reasonable since they are the values for the parameters 
which would actually be measured. 



Since the resulting network is a central-server 

model with a single job type and single- server exponential 

queues, the work of Gaver [38] applies (see subsection 4.2 

of Chapter I) . That is, the joint steady-state probability 

of finding n jobs at the i-th PP (where n ♦ n + n + n 
i 12 3 4 

is less than or equal to the total number of jobs in the 



system) is proportional to the product over i of R raised 

to the n power where E = r p /r . With the choices of r 
i i 5 i i 

and p given in (4.2) and (4.3), respectively, R is the 
i 1 

ratio of the utilization of the i-th PP to the utilization 



5 



of the CPU. This same result generalizes to the less 
specific FCFS model in which both job types are allowed to 
visit each processor. 
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The fact that the exact results from each case (that 
is, each job mix) must be used to derive the parameters for 
the approximate model makes this method of dubious value. 

The exact model must be solved (or measurements taken on the 
system represented) in order for the parameters of the 
approximate model to be determined. If the queuing 
disciplines, service rates, branching probabilities or job 
mix changes, these parameters must be recalculated from new 
measurements. They cannot be determined from the parameters 
of the exact model directly. 

Besides this Baskett, et . al. showed that 
significant differences can exist between the results given 
by the exact model and those given by the approximate model. 
The author has had similar experience in applying this 
method to a variety of two-PP FCFS models. 

An area which is still open for investigation is 
approximation of, for example, the FCFS model by a model in 
which the branching probabilities are the same but the 
service rates for jobs of different classes are replaced by 
a single service rate at each FCFS queue. The work of 
Baskett, et . al. in [6] could then be used to determine the 
measures of system performance in the approximate model. 
Perhaps, with the aid of the FCFS program an appropriate 
choice for the composite service rates could be determined. 

2.3. Sp ecificati o n s fo r Large Jobs 

In this final subsection attention is focused on the 
running time and core requirements of some large jobs. With 
common values for system parameters (Nl, N2, tne service 
rates and the branching probabilities) the FCFS model 
results in more states than either of the other two models. 
The PS queuing discipline at the CPU in the PS model and the 
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inultiserver at the CPU and inficite server at PP1 in the 
tapes model reduce the size of the state space from that 

experienced in the FCPS model. In addition, despite the 

fact that more nonzero entries are reauired per row in the 
rate matrix (COEF) for both the PS model and the tapes 

model, the reduced state spaces in these models result in a 

reduction in the total number of nonzeros in large cases. 

As a result the primary concern here is the requirements of 
the FCFS program. 

A FCFS model with four jobs of each type (N1 = N2 = 
4) and two PP's (NEP = 2) resulted in 3150 states and 10080 
ncnzeros in the rate matrix. Dimensioning the state 
probability array (P) and the three arrays used for storage 
of the balance equations (NCON, INDEX and COEF) accordingly, 
the program required 164K bytes of storage on the IBM 360-67 
computer at the Naval Postgraduate School in Monterey, 
California. The service rates used in this model were 0.6, 
1.2 and 0.9 for type-one jobs and 0.8, 1.0 and 0.5 for 
type-two jobs at the CPU, PP1 and PP2, respectively. The 
branching probabilities were 0.55 and 0.45 for type-one jobs 
and 0.30 and 0.70 for type-two jobs. Convergence was 
attained (using the rather stringent criterion described in 
section 1) in 187 Gauss-Seidel iterations. The total 
execution time for the program was 212.13 seconds. 

A second case using the FCFS model with N1 = 2, N2 = 
5 and NPP = 4 resulted in 6930 states and 35230 nonzeros in 
the rate matrix. Dimensioning P, NCON, INDEX and COEF 
accordingly, the problem required 396K bytes of storage. 

The service rates used were 10.0 for type-one jobs and 1.0 
for type-two jobs at the CPU and the reverse of these at 
each peripheral processor. Branching probabilities were all 
.25 regardless of job type or PP. Convergence was attained 
in 206 iterations. The program required 527.05 seconds of 
executioD time. 
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Ihe PS model was also run using the parameters of 
this second example. The result was 5754 states and 30464 
nonzeros. The dimensions of the FCFS model (6930 and 35280) 
were used in this model and, because of differences in the 
pregrams themselves, this resulted in a requirement for 398K 
bytes of storage. This figure could have been reduced by 
dimensioning P, NCON, INDEX and COEF according to actual 
requirements. Convergence was attained in just 32 
iterations. The program required 98.63 seconds of execution 
time. The measures of system performance reported in Table 
4.4 for these two models aqain point out the extent of 
possible error when a PS CPU is assumed to be a FCFS CPU, or 
visa versa. In this particular comparison type-one jobs are 
"PF bound" and type-two jobs are "CPU bound." 

although the number of iterations required for 
convergence of the Gaass-Seidel technique for solution of 
the balance equations generally increases with the number of 
states, wide variation exists in the number of iterations 
required for a problem with a given number of states. 

Indeed, the author has examples of cases with only 30 states 
requiring more iterations than the 5754-state case reported 
in the preceding paragraph. This should not be too 
surprising. As pointed out in Chapter III, the number of 
iterations required for convergence depends upon the 
magnitude of the nondominaat eigenvalues of the rate matrix. 
The eigenvalues of any matrix depends upon each and every 
entry in the matrix. Changing a single value in the matrix 
will, in general, change all of the eigenvalues. Thus, 
little hope is seen for being able to predict the number of 
iterariens required, or the running time, based upon the 
system parameters. The running time can be reduced by 
relaxing the convergence criterion. Since the real items of 
interest are the measures of system performance, such as 
idleness probabilities and average occupancies, changes in 
these quantities could be used as a basis for determining 
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TABLE 4.4 

CGMPABISON OF FCFS AND PS PROGRAMS IN A 
CASE IN WHICH NPP =4, N1 = 2 AND N2 = 5 



FCF S Proq ra m 



Number of states 6930 

Number cf nonzeros 
in rate matrix 35280 

CPU idleness probability .000197 

PP idleness probability .890 

CPU average occupancy 6.438 

PP average occupancy .141 

CPU throughput 1.308 

PP throughput .327 



PS P r og ram 
5754 

30464 
.000920 
.696 
5.273 
. 432 
2.014 
. 504 



Note that performance at the four PP's is the same. The 
figures reported here are for each PP. 
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convergence. Unfortunately, any savings which might be 
realized by decreasing the number of iterations in this way 
would be diminished by the time required to calculate these 
measures at each iteration. In many cases the result may be 
a net increase in running time. 

Comparisons of the cere requirements and running 
time with these required by other possible solution methods 
are difficult at best. None of the examples presented here 
were run cn a dedicated machine. Because of the effects of 
other jots in the system, rerunning these same jobs would 
likely result in different execution times. Although the 
author was “core conscious" in programming the models, there 
are probably still ways in which core usage could be 
reduced. 



In general, the following statement can be made. If 
a product-form solution exists, running time is bound to be 
shorter fer a program designed to take advantage of it. The 
reason is that no iterations are required, and it is net 
necessary to generate and store the balance equations. Even 
in such cases, however, some of the concepts and techniques 
developed in Chapter II may be helpful in generating all 
states without duplications. 
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7. 



IHE SOFT WARE^a ONI TOR PROBLEM 



Software monitors, or system jobs, are used in most 
large-scale, multiprogrammed computer systems to provide 
data concerning performance of the system and usage of the 
system by individual user jobs. In this chapter some 
possible sources of bias in the data provided by such 
monitors are investigated. Modelling the computer system as 
a closed central-server system, two types of software 
monitors are examined. In each case a method is presented 
which will predict the bias in the monitor. 

Consider a situation in which a central-server model is 
developed for an existing computer system, and a software 
monitor is used to collect system-performance data in order 
to validate the model. The results of this chapter could be 
used to explain differences between model results and 
monitor results. The guesticn of correcting monitored data 
for bias is not addressed here. This remains an open 
question which has apparently not been considered in the 
open literature. 



1. THE FEOBLEM 



One of the major problems in modelling real systems is 
monitoring the system so as to collect the data necessary to 
estimate the parameters used in the model. The same problem 
arises in validation of resulting models where system 
performance must be monitored for comparison with model 
predictions. 
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For ccmputer systems two types of monitors are generally 
available. Hardware monitors are external devices which, 
when attached to various processors, determine the activity 
at the processors by measuring passage of electrical current 
and changes in electrical potential. Software monitors are 
internal devices, or system programs, which use computer 
processing capabilities (CPU time and memory) to collect 
data. -Individual program accounting data (used as a basis 
for charging system users) are one class of data collected 
using software monitors. 

Software monitors, in using computer resources, somewhat 
degrade the resources available to users of the system. For 
example, system programs require storage (often in core) , 
degrading the amount of storage available for very large 
user programs. They also require processing time and 
therefore increase "turnaround time" for users programs. 
These are usually not serious problems, and software 
monitors are generally much less expensive and more flexible 
than hardware monitors. This, plus the fact that it is 
extremely difficult to differentiate between individual 
programs with a hardware monitor, makes the software 
monitors the most likely choice for data collection, whether 
for accounting data, model-parameter data or model- 
validation data. 

In this chapter models of two types of software monitors 
are discussed. A possible source of bias in the data 
collected by such monitors is analyzed. Throughout the 
chapter the computer system under consideration is modelled 
as a closed central-server network of queues as pictured in 
Figure 1.1. This model is assumed to be an accurate 
representation of the system, and steady-state probabilities 
(assumed to exist) calculated from the model are therefore 
also assumed to be e.ccurate. The purpose of the software 
monitor in each case is to determine system performance by 
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observing this probability distribution. A model of the 
monitor is used to determine the distribution "seen" by the 
monitor sc that comparison may be made with the "rrue" 
distribution and bias determined. 

In section 3 consideration is given to a monitor which 
briefly occupies the CPU and determines the state of the 
system each time a job leaves the CPU. In section 4 the 
monitor enters the system infrequently, waits for a free 
server at the CPU, then briefly occupies the CPU and 
determines the state of the system. In both sections the 
problem is analyzed for a Jackson-type network (as described 
in section 2) and then extended to more general cases such 
as those discussed in Chapter II. 



2. THE BASIC MODEL 



The software-monitor models discussed in sections 3 and 
4 will be analyzed extensively in terms of the closed 
central-server model depicted in Figure 1.1 with the 
following assumptions; 

(i) Each queue is a single- server , exponential 
queue with constant service rate and infinite 
capacity. 

(ii) Only one type of job circulates in the 
system . 

(iii) Branching probabilities are constant. 

Under these assumptions the queuing network under 
consideration is called a Jackson network (the term is 
borrowed from Melamed, Zeigler and Beutler [80]) in honor of 
J. E. Jackson [56] who first exhibited a product-form 
solution for the sready-state probability distribution of 
such models. Before examining this solution, consider the 
balance equations which the solution must satisfy. 
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The model described here is a continuous-' time, 
finite-state Markov process which is also irreducible. 

Hence, the steady-state probability distribution exists and 
can be determined by solving a set of balance equations 
together with a normality condition. (See Feller [32].) 
Derivation of the balance equations through limiting 
arguments and the Komolgorov differential equations is a 
standard exercise in such cases and is therefore omitted. 
(See section 3 of Chapter I.) Direct development, of the 
balance equations will follow the introduction of some 
notation . 

Let N denote the total number of jobs circulating in the 

system. The M + 1 processors are numbered sc that processor 

i is PPi for i = 1, 2, ..., M and processor M + 1 is the 

CPU. The service rate at processor i is r and the 

i 

branching probability (from the CPU) to PPi is b . By the 

i 

Markov-like assumptions, a state A is completely specified 

by a listing of the number of jobs at each processor. Thus, 

A = (n ,n ,...,n ) where for each i = 1, 2, ..., M+1, n 

1 2 M+ 1 i 

is the number of jobs at processor i when the system is in 

state A. Note that n +n +...+n =N. Also note 

1 2 M + 1 

that every vector (n ,n ,...,n ) of nonnegative integers 

12 M+1 

which sum to N represents a valid state. 

For any state A = (n ,n ,...,n ), let A denote the 

12 M+1 i j 

state with i-th component increased by one and j-th 
component decreased by one; e. g., A = 

i j 

(n ,...,n +1, ,n -1,...,n ) if i < j. Since A is not 

1 i j M+1 ij 
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a valid state if n =0, any statement containing, for 

j 

example, the probability of A will have that probability 

ij 

multiplied by 0(n ) where 

j 



(5.1) 




if n > 0 
if n < 0 



so that a large variety of special cases need not be 
considered to handle such "non-states.” 



For each state A, P (A) will denote the steady-state 
probability that the system is in state A. 

Let A = (n ,n ,...,n ) be any state. To develop the 

12 M+1 

balance equation associated with state A, note that: 

(i) The rate of transition out of state A (given 
the system is in state A) is given by 

M+1 

(5.2) H (A) = 5 ;zJ(n .) r . 

(ii) The rate of transition from state A into 

j,M + 1 

state A (given n >0 and the system is in state 

M + 1 

A ) is r . 

j,M+1 j 

(iii) The rate of transition from state A 

M + 1/ j 

into state A (given n >0 and the system is in 

j 

state A ) is b r 

M+1,j j M+1 

(iv) The states A and A for j = 1/ 2, 

j,M+1 M+1,j 
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M are the only states from which the system 
can transition in one step into state A. 



Thus, the 


balance eguation associated with state A is 


(5.3) 


M 

R (A)P(A) = ja(n ) 2 r P(A . ) 

M+1 3 3rM+1 

3=1 

M 

+ 2 b V 0(n .)P(A ^ .) 

3 il + 1 3 M+ 1 , 3 

j=1 


Using 


Gordon and Newell's [46] formulation of this 



problem, Gaver [38] has determined that the solution is 



given by 




(5.4) 


M 

P(A) = K tT 

>1 ^ 



where K is a constant which insures that the appropriate 

normality condition is met and, for j = 1,2,...,M, g = 

i 

b r /r . Note from (5.4) that 
j H+1 j 



(5.5) 


P (A ) = g .P (A) 

3, M+1 3 



and 



(5 • 6) 


p (A ) = P (A)/g . 

M+1,j 3 
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3. 



THE EVENT-KEYED SOFTWARE UONITOR 



Now consider attempting to monitor system performance by 
means of a software-monitor job, hereafter called the 
observer, which briefly occupies the server at the CPU each 
time one cf the other N jobs completes service at the CPU. 
While at the CPU, the observer counts the number of jobs at 
each of the tS + 1 processors. The time it occupies the 
server is assumed to be exponentially distributed with mean 
e so small that the disturbance to system operation is 
minimal. In   been  used  in  Chapter  II. 
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The  number,  NPP,  of  FP's  is  an  input  parameter  in 
all  three  programs.   (In  previous  chapters  this  number  has 
been  represented  by  M,  but  in  the  programs  it  is  NPP.)   NPP 
is  restricted  to  be  no  smaller  than  two  and  no  larger  than 
nine.   The  lcwer  bound  is  a  natural  restriction  in  the  tapes 
model  (the  type  one  jobs  have  to  route  somewhere  when  they 
complete  service  at  the  CPU)  and  is  necessary  for 
irreducibility  in  the  FCFS  model.   The  upper  bound  is  to 
help  insure  that  the  right  subvector  (see  Chapter  II, 
subsecticn  3.1)  can  be  stored  as  a  single- precision  integer. 
This  bound  can  te  relaxed,  but  not  withcut  a  careful 
rep rogramning  effort. 

The  number,  N1,  of  type-one  jobs  and  the  number,  N2, 
of  type-twc  jobs  are  input  in  all  three  programs.   Both  N1 
and  N2  are  restricted  to  oe    nonnegative  and  their  sum  must 
net  exceed  nine.   The  upper  bound  is  to  insure  that  the  left 
sufcvector  (see  Chapter  II,  subsection  3. 1)  can  be  stored  as 
a  single-precision  integer.   Together  with  the  upper  bound 
on  NPP,  the  upper  bound  for  N1  +  N2  insures  that  the  right 
sutvector  can  also  be  so  stored. 

The  tapes  program  will  reject  any  case  in  which  N1  = 
0  or  N2  =  0.   If  N1  =  0,  the  FCFS  program  and  the  PS  program 
will  decrease  N2  by  cne,  increase  N1  to  one,  replace  the 
service  rates  and  branching  probabilities  for  type-one  jobs 
by  those  for  type-two  jobs,  and  solve  the  problem  as  though 
there  were  actually  two  types  of  jobs.   (The  case  in  which 
N2  =  0  is  treated  similarly.)   Because  of  this  N1  +  N2  is 
restricted  to  be  no  smaller  than  two.   The  reader  is 
cautioned,  however,  that  output  referring  to  job  type  (for 
example,  the  probability  that  PP2  is  busy  with  a  type-one 
jot)  loses  meaning  when  N1  =  0  or  N2  =  0  initially.   Note 
that  any  case  in  which  there  is  only  one  job  type  can  te 
solved  using  a  product-form  solution.   For  the  FCFS  model 
the  reader  is  referred  to  Gaver  [38].   For  the  other  two 


208 


models  see  Easkett,  Chandy,  Muntz  and  Palacios  [6]. 

For  each  job  type  i  =  1,2  and  each  PPj,  j  = 
1,2,..., NPP,  the  branching  probability,  ALFA(i,j),  cf  a  type 
i  job  to  PPj  is  also  an  input  parameter.   If,  for  some  j  = 

-6 

1,  2,  ...,   KNE,  ALFA(i,j)  <  0.5  x  10    for  both  i  =  1  and  i 

=  2,  then  PP  :  is  eliminated  from  consideration,  the  PP's  are 

renumbered  (if  j  *  NPP) ,  and  NPP  is  decreased  by  one.   NPP 

must  satisfy  the  restrictions  indicated  above  after  all  such 

-6 
trimming  has  taken  place.   If  |ALFA(i,j)j  <  0.5  x  10    for 

some  i  and  j,  then  ALFA(i,j)  is  set  equal  to  zero.  If  any 
ALFA(i,j)  is  negative,  an  error  message  is  printed  and  the 
problem  is  net  solved. 

For  each  job  type  i  =  1,2  and  each  processor  j  =  1, 

2,  ...,  NPP  +  1  (processor  NPP  ♦  1  is  the  CPU),  the  service 

rate,  RATZ(i,j),  cf  type  i  jobs  at  processor  j  is  an  input 

-6 
parameter.   If  any  such  rate  is  smaller  than  0.5  x  10   ,  an 

error  message  is  printed  and  the  problem  is  not  solved. 

Note  that  tests  are  performed  in  the  order  that  the 
restrictions  are  presented  here.   Hence,  a  negative 
branching  probability  or  service  rate  will  net  cause 
abortion  cf  the  problem  if  the  affected  PP  is  eliminated,  or 
if  it  is  for  type-one  jobs  and  N1  =  0. 

Each  program  is  divided  into  a  main  routine  and 
seven  subroutines: 

MAIN:   Controls  flow  of  the  program;  contains  read 
statements  for  input;  develops  balance  equations. 
CKOUT:   Checks  input  parameters  for  adherence  to  the 

above  restrictions. 
INIT:   Stores  the  right  and  left  subvectors  and 
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associated  pointer  vectors. 

LOOKUP:   Calculates  the  state  number  from  the  vector 

representation. 

PIG:   Calculates  the  number  of  combinations  of  n   + 

1 

n   things  taken  n   at  a  time. 
2  1 

GSSOL:   Selves  the  balance  equations. 

PENTP:   Prints  the  steady-state  probability 

distributions  (no  more  than  one  thousand  states) . 

ACCUM:   Calculates  and  prints  the  measures  of  system 

performance. 

Subroutines  PIG,  GSSOL  and  PRNTP  are  the  same  for  all  three 

programs.   CKCUT  is  the  same  in  the  FCFS  and  PS  programs. 

Although  the  coding  differs  in  the  tapes  program,  the  same 

tests  are  performed  in  CKOUT  of  the  tapes  program. 

The  Gauss-Seidel  iterative  method,  discussed  in 
subsection  4.3   of  Chapter  III,  is  used  in  GSSOL  to  solve 
the  balance  equations.   Each  component  of  the  starting 
vector  is  equal  to  1/NSTATS,  where  NSTATE  is  the  number  of 
states  (and  balance  equations) .   The  method  is  considered  to 
have  converged  if 


(n)  (n-1)  (n)  -6 

(4.1)  max  |X  -    X  I    /   X  <      0.5    x    10 

i  i  i 

1<i<NSTATE 


(n)  (n-1) 

where    X  and   X  are  the    i-th   components   of, 

i        i 

respectively,  the  n-th  and  the  (n-1) -st  generated  vector 

(n)    n     (n)  . 
The  solution  method  guarantees  that  X    >  0.   X     is 

i  i 

assumed  to  be  zero  if  it  is  calculated  to  be  less  than 
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-27  (n) 

0.5  x  10    .In  the  case  that  X    is  zero,  the  fraction  in 

i 

(n-1) 
(4.1)  is  set  equal  to  zero  if  X      is  zero  or  one  if  not. 

i 

If  convergence  has  not  been  attained  in  five  hundred 

iterations  of  the  Gauss-Seidel  method,  a  message  is  printed 

and  the  problem  is  aborted.   The  convergence  criterion  used 

here  is, ' adaittely ,  very  stringent.   However,  it  permits  the 

luxury  of  not  normalizing  until  convergence  has  been 

attained.   All  variables  and  calculations  are  single 

precision . 

The  (nonoptional)  output  for  all  three  programs 
consists  cf  the  input  parameters,  the  total  number  of 
states,  the  cumber  cf  nonzero  terms  in  the  balance 
eguations,  the  number  of  iterations  for  convergence  of  the 
Gauss-Seidel  method,  and  for  each  processor:   the  idle 
probability,  the  probability  that  it  is  busy  with  each  type 
of  job,  the  average  occupancy,  and  the  average  length  of 
waiting  line.   Optional  output  includes  the  state  vectors, 
the  balance  equations  and  up  to  one  thousand  steady-state 
probabilities. 

Having  considered  the  similarities  in  the  three 
programs,  a  discussion  of  the  differences  is  in  order. 


1.3.   Differences  in  the  Programs 

For  the  tapes  program  the  number,  MP,  of  CPU's  (i. 
e.,  the  number  of  servers  at  the  CPU)  is  an  input  parameter 
This  number  is  set  equal  to  one  if  a  nonpositive  number  is 
provided  through  the  read  statement.   Note  that  the  CPU  can 
be  made  an  itfinite  server  queue  by  setting  the  number  of 
servers  equal  to  any  number  greater  than  or  equal  to  N1  + 
N2. 


211 


The  sequencing  of  the  states  is  according  to  a 
lexicographic  crdering  of  the  ISTATEml  vector 
representations  for  the  FCFS  and  PS  models  and  according  to 
a  lexicographic  ordering  of  a  slightly  modified  version  of 
the  ISTATlm2  vector  representations  for  the  tapes  model. 
The  modification  used  in  the  tapes  model  would  list  the  jobs 
at  PP1  (necessarily  all  twos)  at  the  right  end  rather  than 
the  left. end  of  the  left  subvector. 

In  addition  to  the  nonoptional  output  indicated  in 
1.2,  the  rapes  program  prints  the  average  number  of  each 
type  jot  at  each  processor,  the  average  number  of  CPU's  busy 
with  each  type  of  job,  the  average  number  of  CPU's  which  are 
idle,  the  distributions  from  which  many  of  these  averages 
are  calculated  (e.g.,  the  distribution  of  the  number  of  idle 
CPU's),  and  throughput  for  each  processor.   The  throughput 
of  a  processor  is  the  average  number  of  jobs  completing 
service  at  that  processor  per  unit  time.   It  may  be 
calculated  fcr  a  single  server  FCFS  processor,  for  example, 
by  multiplying  the  processing  rate  for  each  type  job  by  the 
prcporticn  of  time  the  server  is  busy  with  that  type  of  job 
and  accumulating  over  all  job  types. 


2.   NUMEBICAL  BESULTS 


Numerical  results  from  utilization  of  the  three  programs 
discussed  in  the  first  section  of  this  chapter  are  given  in 
this  section.   The  results  given  in  the  first  subsection 
shew  how  the  three  models  can  to  ways  in  which  soma  of  these 
models  might  be  approximated  by  models  be  compared  for  a 
particular  example.   The  second  subsection  is  devoted 
satisfying  the  conditions  of  local  balance.   The  chapter  is 
concluded  with  some  indications  of  running  times  and  core 
requirements  for  relatively  large  jobs. 
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2.1-      Use   of   the   Programs   to   Examine    Changes   in    CPU 
Configuration 

Baskett,    Chandy,    Muntz    and   Palacios   £6]  considered 

the   network    depicted    in   Figure   4.1.      The   queuing   discipline 

at   the  CPU    is   PS    and   each    peripheral    is  a    single   server   FCFS 

queue.      The    rates    for   the   exponential    service   distributions 

are    given   below   the    box   representing   the   corresponding 

processor    (for   example,    r        is   the   service   rate   of    type-one 

15 

jobs  at  the  CPU)  .   Only  the  nonzero  branching  probabilities 

are  given  in  Figure  4.1.   Since  each  peripheral  sees   only 
cne  type  cf  job,  the  system  satisfies  the  conditions  of 
local  balance.   Thus,  Baskett,  et.  al.  were  able  to  use  a 
prcduct-fcrm  solution  to  determine  the  utilization  of  each 
processor  as  the  number  of  type-one  jobs  increased  and  the 
number  of  type-two  jobs  remained  fixed  at  one.   The  idleness 
probabilities  (one  minus  the  utilization)  for  the  cases 
reported  by  Baskett,  et.  al.   are  given  in  the  first  column 
of  Table  4.1. 

Note  that  this  example  is  one  which  the  PS  model  is 
designed  to  handle .   The  PS  program  was  used  to  verify  the 
results  reported  in  [ 6  ]  and  to  provide  the  average 
occupancies  and  throughputs  as  reported  (for  the  same 
example)  in  the  first  column  of  Tables  4.2   and  4.3 
respectively. 

The  FCFS  program  was  used  to  give  the  second  column 
in  these  three  tables.   A  comparison  of  the  first  two 
columns  of  each  table  reveals  the  effect  of  changing  the  PS 
CPU  to  a  FCFS  CPU.   Note  that  the  differences  become  more 
pronounced  as  the  number  of  jobs  increases. 
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Figure  4.1- -Network  of  Baskett,  et.  al.  Example 
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TABLE  4.1 
IDLENESS  PROBABILITY  FOR  BASKETT,  et.  al.  MODEL 


CPU  gueulng 

■ 

discipline 

PS 

FCPS 

FCFS 

FCFS 

FCFS 

FCFS 

L 5  t 

JPCFS. 
6 

FCFS 
I 7__ 

FCFS 

[ 8_. 

_ jumber_CPl 

I'S 

1__ 

1 

2 

3__ 

4__ 

Processor 

N1 

.322 

.  320 

.634 

.765 

.317 

.854 

.878 

.395 

.909 

CPU 

1 

CPU 

2 

.230 

.  271 

.  598 

.731 

.799 

.839 

.866 

.  385 

.899 

CPU 

3 

.256 

.238 

.577 

.717 

.768 

.830 

.859 

.  879 

.894 

CPU 

4 

.241 

.214 

.564 

.709 

.782 

.825 

.854 

.  875 

.89  1 

CPU 

c 

.231 

.  197 

.556 

.7  04 

.773 

.822 

.857 

.873 

.889 

CPU 

6 

.225 

.  134 

.551 

.700 

.775 

.820 

.850 

.872 

.888 

CPU 

7 

.221 

.  175 

.54  8 

.698 

.774, 

.819 

.849 

.871 

.887 

PP1 

1 

.629 

.6  05 

.600 

.600 

.600 

.600 

.600 

.600 

.600 

PP1 

2 

.648 

.610 

.600 

.600 

.600 

.600 

.600 

.600 

.600 

PP1 

"m 

.661 

.615 

.600 

.600 

.600 

.600 

.600 

.600 

.600 

PP1 

4 

.670 

.620 

.601 

.600 

.600 

.600 

.600 

.600 

.  600 

PP1 

5 

.  676 

.6  24 

.601 

.600 

.600 

.600 

.600 

.600 

.600 

PP1 

6 

.679 

.628 

.601 

.600 

.600 

.600 

.600 

.600 

.600 

PP1 

7 

.682 

.631 

.601 

.600 

.600 

.600 

.600 

.600 

.600 

PP2 

1 

.616 

.726 

.536 

.5  86 

.586 

.586 

.586 

.  586 

.586 

PP2 

2 

.354 

.546 

.  357 

.352 

.352 

.352 

.352 

.352 

.352 

PP2 

i 

.257 

.417 

.224 

.218 

.218 

.218 

.218 

-  2  18  j 

.218 

PP2 

4 

.  169 

.320 

.  143 

.  138 

.138 

.  138 

.133 

.  138 

.  138 

PP2 

c 

.  1  12 

.246* 

.093 

.089 

.089 

.089 

.089 

.  089 

.089 

PP2 

6 

.074 

.  188 

.061 

.058 

.058 

.058 

.053 

.  053 

.053 

PP2 

7 

.049 

.  143 

.040 

.038 

.038 

.038 

.038 

.038- 

.038 

PP3 

1 

.744 

.317 

.724 

.724 

.724 

.724 

.724 

.724 

.724 

PP3 

2 

.596 

.697 

.571 

.568 

.563 

I. 568 

.563 

.568 

.563 

PP3 

.505 

.611 

.483 

.479 

.  4  7  9  J  .  4  7  9 

.479 

.479 

.479 

PP3 

a 

.446 

.547 

.429 

.426 

.425 

.425 

'*&$ 

.  425 

.425 

PP3 

c; 

.408 

.497 

.395 

.393 

.393 

.393 

.393 

.  393 

.393 

PP3 

6 

.333 

.4  59 

.374 

.372 

.372 

.372 

.372 

.  372 

.372 

PP3 

7 

.366 

.429 

.360 

.359 

.359| .359 
j 

.359 

.359 

.359 

PP4 

1 

.335 

.  883 

.322 

.822 

.822j .822 

.822 

.822 

.322 

PP4 

2 

.740 

.805 

,724 

.722 

.7221 .722 

.722 

.722 

.722 

PP4 

.682 

.750 

.667 

.665 

.665  .665 

.665 

.  665 

.  665 

PP4 

4 

.644 

.709 

.  633 

.631 

.631 j. 631 

.631 

.  631 

.  63  1 

PP4 

5 

.6  19| .677 

.611 

.610 

.610|.610 

.610 

.610 

.610 

PP4 

6 

.6031 .652 

.597 

.596 

.5961.596 

|  .596 

.  596 

.  596 

PP4 

L_2_- 

.^593. 

Li633. 

.588 

.^588. 

L^.588 

U588. 

L.588 

.  583 

.  588 
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TABLE    4.2 
AVERAGE    OCCUPANCIES    FOR    EASKETT,    et .    al.    MODEL 


CPU  queuing 
4  _  dis  c  ijb  lie  e 

Number  CPU's 

PS 

1_. 

l£CFS_! 

j_ 

* 

JLQ.ZS_ 

.FCFS. 

.FCFS. 

.FCFS. 

.FCFS. 

FCFS 

FCFS 

i 2_. 

3_. 

4_ 

_5_. 

6 

7__ 

8 

Processor. 

NJ_ 

CPU 

1 

.824 

1.03 

.732 

.732 

.732 

.732 

.732 

.732 

.732 

CPU 

2 

.999 

1.41 

.819 

.806 

.806 

.806 

.806 

.  806 

.  806 

CPU 

3 

1.  13 

1.74 

.377 

.849 

.848 

.848 

.848 

.848 

.848 

CPU 

4 

1.23 

2.04 

.916 

.876 

.374 

.874 

.874 

.  874 

.  874 

CPU 

5 

1.30 

2.30 

.  941 

.8  92 

.889 

.889 

.889 

.389 

.889 

CPU 

6 

1.36 

2.52 

.959 

.903 

.399 

.899 

.899 

.899 

.899 

CPU 

7 

1.39 

2.72 

.971 

.909 

.906 

.905 

.905 

.  905 

.905 

PP1 

1 

.37  1 

.395 

.400 

.400 

.400 

.400 

.400 

.400 

.400 

PP1 

2 

.352 

.390 

.400 

.400 

.400 

.400 

.400 

.400 

.400 

PP1 

3 

.339 

.385 

.400 

.400 

.400 

.400 

.400 

.  400 

.400 

PP1 

4 

.330 

.3  80 

.399 

.400 

.400 

.400 

.400 

.400, 

.400 

PP1 

5 

.324 

.376 

.399 

.400 

.400 

.400 

.400 

.400 

.400 

PP1 

6 

.321 

.3  72 

.399 

.400 

.400 

.400 

.400 

.400 

.400 

PP1 

7 

.318 

.369 

.399 

.400 

.400 

.400 

.400 

.400 

.400 

PP2 

1 

.384, 

.274 

.414 

.414 

.414 

.414 

.414 

.  414 

.414 

PP2 

2 

.839 

.  o02 

.910 

.916 

.916 

.916 

.916 

.916 

.916 

PP2 

3 

1.37 

.980 

1.43 

1.50 

1.50 

1.50 

1.50 

1  .50 

1.50 

PP2 

4 

1.97 

1.41 

2.13 

2.15 

2.  15 

2.  15 

2.  15 

2.  15 

2.  15 

PP2 

5 

2.63 

1.  89 

2.83 

2.  87 

2.87 

2.  87 

2.87 

2.37 

2.87 

PP2 

6 

3.36 

2.42 

3.60 

3.65 

3.65 

3.65 

3.65 

3.65 

3.65 

PP2 

7 

4.15 

3.  00 

4.42 

4.47 

4.47 

4.47 

4.47 

4.47 

4.47 

PP3    i 

1 

.  256 

.  183 

.  27o 

.276 

.276 

.  276 

.276 

.276 

.276 

PP3 

4 

.508 

.  373 

.547 

.551 

.551 

.551 

.551 

.551 

.551 

PP3 

3 

.747 

.567 

.301 

.308 

.809 

.809 

.809 

.  809 

.809 

PP3 

4 

.968 

.758 

1.03 

1  .04 

1  .04 

1.04j 

1.04 

1  .04 

1.04 

PP3 

5 

1.16 

.9  44 

1.  23 

1.24 

1.24 

1.24 

1.24 

1.24 

1.24 

PP3 

6 

1.34 

1.  12 

I  1.39 

1.41 

1.41 

1.41 

1.41 

1  .41 

1.41 

PP3 

7 

1.48 

1  .29 

|1.53 

1.  54 

1.54 

1.54 

1.54 

1  .54 

1.54 

PP4 

1 

.  165 

.  1  17 

.  178 

.178 

.  178 

.178 

.  178 

.  178 

.  178 

PP4 

2 

.30  3 

.  226 

|.325 

.327 

.327 

.327 

.327 

.  327 

.  327 

PP4 

3 

.415 

.325 

.441 

.445 

.445 

.445 

.445 

.  445 

.445 

PP4 

4 

.504 

.414 

|  .529 

.533 

.534 

.534 

.534 

.  534 

.  534 

PP4 

5 

.57  2 

.493 

.595 

.599 

.599 

.599 

.599 

.  599 

.599 

j    PP4 

6 

|  .624 
[.661 

.562 

.642 

.645 

.646 

.646 

.646 

.646 

.646 

L2_. 

.621 

.2.9.1^ 

.678 

.679 

.679J 

.679, 

.679j 

.679 
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TABLE    4.3 
IHfiOUGHPUT    FOR    BASKETT,    et.    al.    MODEL 


CPU    queuing 

'— "" '                " 

I 1 

-j 

' 

"   '              """■ 

i 

• 

-___. 

1     "     '  "    ■ 

j __  discipline 

_PS_. 

FCFS 

FCFS 

.FCFS. 

.FCFS. 

.FCFS.rFCFS. 

.FCFS. 

FCFS.. 

Number    CPU's 

1 

1    ^ 

"    2 

3 

4 

5 

6 

7 

8 

__P  10  cesser. 

N1 

CPU 

1 

1.65 

1.  38 

1  .78 

1.78| 

1  .78 

1  .78 

1.78 

1  .78 

1.73 

CPU 

2 

2.26 

1.88 

2.44 

2.45 

2.45 

2.45 

2.45 

2.45 

2.45 

CPU 

3 

2.63 

2.  24 

2.82 

2.83 

2.  83 

2.  83 

2.33 

2.83 

2.83 

CPU 

4 

2.87 

2.  51 

3.05 

3.  06 

3.06 

3.06 

3.06 

3.06 

3.06 

CPU 

c 

3.02 

2.72 

3.19 

3.20 

3.20 

3.20 

3.20 

3.20 

3.20 

CPU 

6 

3.  13 

2.  88, 

3.28 

3.29 

3.29 

3.29 

3.29 

3.29 

3.29 

CPU 

7 

3.19 

3.00 

3.34 

3.35 

3.35 

3.35 

3.35 

3.35 

3.35 

PP1 

1    ' 

.556 

.593 

.600 

.600 

.600 

.600 

.600 

.600 

.600 

PP1          | 

2 

.527 

.585 

.600 

.600 

.600 

.600 

.600 

.600 

.600 

PP1 

■4 

.506 

.577 

.599 

.600 

.600 

.600 

.600 

.600 

.600 

PP1 

4 

.495 

.570 

.599 

.600 

.  600 

.600 

.600 

.  600 

.600 

PP1 

c 

.487 

.5  64 

.599 

.600 

.600 

.600 

.600 

.600 

.600 

PP1 

6 

.  48  1 

.558 

.599 

.600 

.600 

.600 

.600 

.600 

.600 

PP1 

7 

.477 

.553 

.599 

.600 

.600 

.600 

.600 

.600 

.600 

PP2 

1 

.  384 

.2  74 

.4  14 

.414 

.414 

.414 

.414 

.  414 

.414 

PP2 

2 

.606 

.454 

.643 

.647 

.647 

.647 

.647 

.  647 

.647 

PP2 

.743 

.5  83 

.776 

.782 

.782 

.782 

.732 

.782 

.782 

PP2 

4 

.831 

.680 

.857 

.861 

.862 

.  862 

.862 

.  86  2 

.  862 

PP2 

5 

.888 

.7  54 

.907 

.911 

.91  1 

.  91  1 

.911 

.  91  1 

.911 

PP2 

6 

.926 

.812 

.939 

.942 

.942 

.942 

.942 

.  942 

.942 

PP2 

7 

.951 

.857 

.960 

.962 

.962 

.962 

.962 

.  96  2 

.96  2 

PP3 

1 

.384 

.274 

.414 

.414 

.414 

.414 

.414 

.  414 

.  414 

PP3 

2 

.  606 

.  454 

.643 

.648 

.648 

.648 

.648 

.648 

.643 

PP3 

~ 

.743 

.5  83 

.776 

.782 

.782 

.782 

.782 

.782 

.782 

PP3 

4 

.831 

.680 

.357 

.862 

.862 

.  862 

.862 

.  862 

.  862 

PP3 

5 

.888 

.7  54 

.907 

.911 

.91  1 

.91  1 

.911 

.  91  1 

.911 

PP3 

6 

.926 

.812 

.  939 

.942 

.942 

.942 

.942 

.  942 

.942 

PP3 

7 

.951 

.857 

.960 

.962 

.962 

.962 

.962 

.  96  2 

.96  2 

PP4 

1 

.  32  9 

.235 

.355 

.355 

.35  5 

.355 

.355 

.355 

.355 

PP4 

2 

.520i 

.389 

.5i>1 

.555 

.555 

.555 

.555 

.  555 

.555 

PP4 

.637 

.500] 

.665 

.670 

.670 

.670 

.670 

.670 

.670 

PP4 

4 

.712 

.583 

.734 

.739 

.739 

.739 

.739 

.739 

.739 

PP4 

5 

.761 

.646 

.778 

.781 

.781 

.781] 

.78  1 

.781 

.781 

PP4 

6 

.793 

.696 

.805 

.  807 

.308] 

.808 

.303 

.808 

.803 

PP4 

7 

.81  5j 

.734 

.823j 

.825 

.  32  5j 

.825 

.825 

.  825 

.825 
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Since  only  type-two  jobs  are  allowed  to  route  to 
PP1,  and  since  only  one  type-two  job  is  considered,  PP 1  is 
an  IS  queue  and  the  tapes  program  can  be  used  to  determine 
the  effect  cf  increasing  the  number  of  (FCFS)  CPU's.   This 
was  done  and  the  results  are  also  reported  in  the  three 
tables. 

Consider  in  more  detail,  now,  the  effect  of 
replacing  a  £S   CPU  with  a  FCFS  CPU  in  a  model  such  as  that 
depicted  in  Figure  4.1.   Since  the  operations  are  exactly 
the  same  in  the  two  models  except  when  the  system  is  in  a 
state  in  which  two  or  more  jobs  occupy  the  CPU,  only  these 
states  will  be  considered.   In  the  PS  CPU  each  job  receives 
service  immediately  upcn  arrival.   However,  the  rate  of  this 
service  is  degraded  according  to  the  number  of  jobs  which 
occupy  the  CPU.   In  the  FCFS  CPU  there  is  no  degradation  of 
service.   This  fact  supports  a  tendency  toward  shorter 
occupancy  times  (waiting  time  plus  service  time)  for 
individual  jobs  and,  therefore,  greater  throughput  for  the 
JCFS  CPU  than  for  the  PS  CPU.   However,  this  tendency  is 
counterbalanced  by  the  fact  that  jobs  must  wait  in  turn  for 
service  in  the  FCFS  CPU,  but  receive  service  immediately  in 
the  PS  CPU.   This  latter  effect  becomes  dominant  if  the 
service  rates  for  one  type  of  job  is  larger  at  the  CPU  and 
the  PP's  than  that  for  the  other  type  of  job  is  at  the  CPU. 

Such  a  case  is  exemplified  in  the  model  by  Baskett, 
et.  al.   [6]  discussed  above  and  depicted  in  Figure  4.1.   In 
this  example  the  service  rates  for  the  type-one  jobs  are 
larger  at  most  PP's  and  much  larger  at  the  CPU  than  the 
service  rate  for  type-two  jobs  at  the  CPU.   In  the  PS  model 
a  type-one  job  arriving  at  the  CPU  after  a  type-two  job 
might  complete  service  at  the  CPU,  receive  service  at  cne  of 
the  PP's  and  return  to  the  CPU  several  times  before  the 
type-two  job  leaves  the  CPU.   In  the  FCFS  model  the  type-one 
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jot  would  be  forced  to  wait  for  service  until  the  type-two 
jot  left  the  CPU.   As  emphasized  in  the  preceding  paragraph, 
the  effects  of  degraded  service  in  the  PS  model  are 
counterbalanced  by  the  effects  of  forced  waiting  in  the  FCFS 
model.   Hcwever,  as  can  be  seen  in  Table  4.3,   the  net 
result  in  the  cases  for  which  data  are  given  is  a  decrease 
in  throughput  at  the  CPU  when  the  PS  CPU  is  replaced  by  a 
FCFS  CPU.'   This  results  from  a  decrease  in  the  throughput 
for  type-cne  jobs  and  an  increase  for  type-two  jobs,  as  can 
be  seen  by  examining  the  throughput  at  the  PP's.   From 
Tables  4.1   and  4.2   it  is  also  seen  that  idleness 
pre tabilities  decrease  at  the  CPU  and  PP1  and  increase  at 
PP2,  PP3  and  PP4,  and  average  occupancies  increase  at  the 
CPU  and  PP1  and  decrease  at  PP2,  PP3  and  PP4. 

The  difference  in  results  between  the  PS  model  and 
the  FCFS  model  increase  as  the  number,  N1,  of  type-cne  jobs 
increases,  while  the  number  of  type-two  jobs  remains 
cocstatn  at  one.   The  reason  for  this  is  that  the 
degradation  cf  service  in  the  PS  CPU  becomes  less  sericus 
for  the  type-one  jobs  and  more  serious  for  the  type-twc  job 
as  the  nujober  of  jobs  at  the  CPU  increases.   For  example, 
with  one  job  of  each  type  at  the  CPU,  the  rate  of  departure 
of  jobs  from  the  CPU  is  9/2  for  type-one  jobs  and  1/2  for 
type-two  jobs.   Increasing  the  number  of  type-one  jobs  to 
fcur  changes  these  rates  to  36/5  and  1/5. 

Finally,  consider  the  effect  of  increasing  the 
number  of  (FCFS)  servers  at  the  CPU.   Note  that  in  those 
cases  in  which  the  number  of  type-one  joos  is  smaller  than 
the  number  of  CPU's,  the  CPU  has  an  IS  queuing  discipline. 
Frcm  this  point  on,  increasing  the  number  of  CPU's  has  no 
effect  on  the  operation  of  the  network..   The  reason  that 
idleness  probabilites  in  Table  4.1  continue  to  change  past 
this  point  for  the  CPU  is  that  they  were  found  by  dividing 
the  average  number  of  idle  CPU's  by  the  total  number  of 
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CPU's.   If  each  idle  CPU  has  an  egual  chance  of  being  chosen 
by  a  newly  arriving  job,  this  idleness  probability  is  the 
proporticn  of  time  that  each  CPU  is  found  idle.   Since  these 
IS  CPU  cases  provide  immediate,  full  service  (as  opposed  to 
the  delayed  service  of  the  FCFS  CPU  and  the  degraded  service 
of  the  PS  CPU)  throughput  is  a  maximum  for  all  processors. 
Idleness  probability  is  small  and  average  occupancy  large 
(relative  to  all  other  cases  reported  for  the  same  job  mix) 
at  the  PE's. 

As  the  number  of  servers  at  the  CPU  increases  from 
one  to  N 1  +  1,  througnput  at  all  processors  increases.   The 
reason  for  this  is  obvious  at  the  CPU  and  nearly  so  at  the 
PP's:    More  servers  at  the  CPU  means  more  rapid  return  of 
jots  to  the  PP's,  resulting  in  a  greater  proportion  of  time 
spent  serving  jobs.   Similarly,  the  idleness  probabilities 
at  the  PP's  decrease  and  the  average  occupancies  increase. 
The  result  is  a  corresponding  decrease  in  average  occupancy 
and  a  resulting  increase  in  idleness  probability  at  the  CPU. 
Note  that  in  each  case  there  is  a  point  beyond  which 
increasing  the  number  of  CPU's  (that  is,  the  number  of 
servers  at  the  CPU)  does  not  significantly  improve  the 
system.   For  example,  with  N1  =  7  there  are  no  changes  in 
the  measures  of  effectiveness  (down  to  the  third  significant 
digit)  fcr  the  PP's,  and  little  change  in  those  for  the  CPU, 
as  the  number  of  servers  at  the  CPU  is  increased  beyond 
three.   The  reason  for  this  phenomena  is  that  the  PP's  have 
become  the  bottleneck  at  this  point.   Judging  from  the 
results  given  in  the  three  tables,  increasing  the  number  of 
CPU's  beycnd  two  would  probably  not  be  justified  if  the  job 
mixes  given  are  representative  of  those  which  would  be  seen 
by  the  computer  modelled. 
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2.2.   Approximation  by_  Local  Balance  Networks 

In  this  subsection  consideration  is  given  to  ways  in 
which  a  queuing  network  which  does  not  satisfy  the  local- 
balance  conditions  might  be  approximated  by  one  which  does. 
The  goal  .is  to  find  an  approximation  whose  steady-state 
properties  are  vary  nearly  the  same  as  those  of  the  original 
network.   If  such  an  approximation  can  be  found,  much  time 
and  effort  cculd  be  saved  by  taking  advantage  of  the 
product-fcrm  solution  to  the  approximate  model. 

Cf  particular  interest  is  the  FCFS  model  discussed 
in  secticn  1.   This  model  does  not  satisfy  the  local-balance 
conditions  because  it  contains  at  least  one  FCFS  queue  at 
which  jobs  cf  different  types  receive  service  at  different 
rates.   As  can  be  seen  from  subsection  5.3   of  Chapter  I, 
the  work  cf  Easkett,  Chandy,  Muntz  and  Palacios  [6] 
indicates  that  local  balance  would  prevail  if  the  queuing 
discipline  at  these  queues  were  PS,  IS  or  LCFSPH  rather  than 
FCPS,  or  if  the  processing  rates  were  the  same  for  the  two 
job  types  at  each  FCFS  processor.   These  observations  will 
form  the  basis  for  most  of  the  approximations  discussed 
here. 

Reviewing  the  example  of  Baskett,  et.  al.   expanded 
upon  in  subsection  2*1,  it  is  seen  that  replacing  the  FCFS 
CPU  by  either  an  IS  or  a  PS  CPU  could  lead  to  significant 
differences  in  the  measures  of  system  performance  (see 
Tables  4.1,  4.2  and  4.3).   The  reason  for  this  is  that  the 
rapidly-mcvirg  type-one  jobs  must  at  times  wait  at  the  FCFS 
CPU  for  the  slew-moving  type-two  jobs  to  complete  service. 
In  the  IS  CPU  there  is  no  interference  between  the  two  types 
of  jobs.   In  the  FS  CPU  there  is  interference  in  the  form  of 
degraded  service,  but  the  extreme  difference  in  service 
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rates  makes  this  interference  considerably  less  severe  than 
that  in  the  FCFS  CPU. 

If  the  FCFS  CPU  were  replaced  with  a  LCFSPH  CPU  in 
the  same  model,  seme  interference  of  the  type  encountered  in 
the  FCFS  CPU  would-  occur.   The  completion  of  service  of  any 
type-one  job  at  the  (LCFSPR)  CPU,  whether  the  type-one  job 
was  enqueued  or  in  service  when  the  type-two  job  arrived, 
would  necessarily  wait  until  the  type-two  job  had  completed 
its  service.   However,  the  high  service  rate  of  the  type-one 
jots  at  the  CPU  as  compared  to  their  rates  at  the  PP's  makes 
the  probability  that  the  type-two  job  will  catch  one  or  more 
type-one  jobs  at  the  CPU*  when  it  arrives  rather  small.   On 
the  other  hand,  the  chances  are  good  that  the  type-two  job 
will  be  interrupted  (perhaps,  several  times)  while  at  the 
CPD.   The  anticipated  result  is  an  increase  in  idleness 
probability  and  a  decrease  in  average  occupancy  and 
throughput  at  PP 1  and  the  reverse  of  this  at  PP2,  PP3  and 
PP4.   These  facts  are  easily  seen  to  be  true  since  a  careful 
reading  of  [ 6  ]  reveals  that  the  idleness  probabilities, 
average  occupancies  and  throughputs  for  this  example  are  the 
same  whether  the  CPU  has  a  PS  queuing  discipline  or  a  LCFSPR 
queuing  discipline.   That  is,  "PS"  in  Tables  4.1,  4.2      and 
4.3   could  be  replaced  by  "LCFSPR." 

So  it  is  seen  that  simple  replacement  of  a  FCFS 
processor  by, a  PS,  IS  or  LCFSPR  processor  with  the  same 
service  distributions  may  result  in  significantly  different 
values  for  the  measures  of  system  performance.   It  may  be 
possible  to  arrive  at  better  approximations  by  also  changing 
the  service  distributions  (for  example,  by  adjusting  the 
service  rates) .   However,  the  types  of  changes  to  make  are 
not  obvious  if  the  queuing  discipline  is  also  changed. 

In  [6]  Baskett,  et .  al.   investigated  an 
approximation  to  the  above  model  in  which  the  two  classes  of 
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jots  are  replaced  by  a  single  "equivalent"  class  of  jots. 
For  each  value  of  N1  they  retained  the  given  service  rates 
at  the  PP's  and  replaced  the  service  rates  at  the  CPU  ty 


,..2,      r5   =   ,9i  ♦  ,2,  /  ^/r^  ♦  ^/r^, 


and  the  branching  probabilities  by 


(4.3)     p   =  (qp   +  q  p  )  /  (q  +  q  ) 

j        1  1j     2  2]       1     2 


where  q      is  the  rate  at  which  class  i  jobs  leave  the  CPU. 
i 

These  rates  are  determined  from  the  solution  of  the  original 

model.   These  choices  for  the  rate  at  the  CPU  and  the 
branching  probabilities  for  the  "equivalent"  class  of  jobs 
are  reasonable  since  they  are  the  values  for  the  parameters 
which  wculd  actually  be  measured. 

Since  the  resulting  network  is  a  central-server 

model  with  a  single  job  type  and  single-server  exponential 

queues,  the  work  of  Gaver  [38]  applies  (see  subsection  4.2 

of  chapter  I) .   That  is,  the  joint  steady-state  probability 

of  finding  n   jobs  at  the  i-th  PP  (where  n   +  n   +  n   +  n 
i  12     3     4 

is  less  than  or  equal  to  the  total  number  of  jobs  in  the 

system)  is  proportional  to  the  product  over  i  of  R   raised 

i 

to  the  n   power  where  R   =  r  p  /r  .   With  the  choices  of  r 
i  i     5  i   i  5 

and  p   given  in  (4.2)  and  (4.3),  respectively,  R   is  the 
i  i 

ratio  of  the  utilization  of  the  i-th  PP  to  the  utilization 

of  the  CPU.   This  same  result  generalizes  to  the  less 
specific  ICYS   model  in  which  both  joa  types  are  allowed  to 
visit  each  processor. 
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The  fact  that  the  exact  results  from  each  case  (that 
is,  each  job  mix)  must  be  used  to  derive  the  parameters  for 
the  approximate  model  makes  this  method  of  dubious  value. 
The  exact  model  must  be  solved  (or  measurements  taken  on  the 
system  represented)  in  order  for  the  parameters  of  the 
approximate  model  to  be  determined.   If  the  queuing 
disciplices,  service  rates,  branching  probabilities  or  job 
mix  changes,  these  parameters  must  be  recalculated  from  new 
measurements.   They  cannot  be  determined  from  the  parameters 
of  the  exact  model  directly. 

Besides  this  Baskett,  et .  al.   showed  that 
significant  differences  can  exist  between  the  results  given 
by  the  exact  model  and  those  given  by  the  approximate  model. 
The  author  has  had  similar  experience  in  applying  this 
method  to  a  variety  of  two-PP  FCFS  models. 

An  area  which  is  still  open  for  investigation  is 
approximation  of,  for  example,  the  FCFS  model  by  a  model  in 
which  the  branching  probabilities  are  the  same  but  the 
service  rates  for  jobs  of  different  classes  are  replaced  by 
a  single  service  rate  at  each  FCFS  queue.   The  work  of 
Baskett,  et .  al.  in  [6]  could  then  be  used  to  determine  the 
measures  cf  system  performance  in  the  approximate  model. 
Perhaps,  with  the  aid  of  the  FCFS  program  an  appropriate 
choice  for  the  composite  service  rates  could  be  determined. 

2.3.   Specifications  for  Large  Jobs 

In  this  final  subsection  attention  is  focused  en  the 
running  time  and  core  requirements  of  some  large  jobs.   With 
common  values  for  system  parameters  (N1,  N2,  tne  service 
rates  and  the  branching  probabilities)  the  FCFS  model 
results  in  more  states  than  either  of  the  other  two  models. 
The  PS  queuing  discipline  at  the  CPU  in  the  PS  model  and  the 


224 


multiserver  at  the  CPU  and  infinite  server  at  PP1  in  the 
tapes  model  reduce  the  size  of  the  state  space  from  that 
experienced  in  the  FCFS  model.   In  addition,  despite  the 
fact  that  more  nonzero  entries  are  required  per  row  in  the 
rate  matrix  (COEF)  for  both  the  PS  model  and  the  tapes 
model,  the  reduced  state  spaces  in  these  models  result  in  a 
reduction  in  the  total  number  of  nonzeros  in  large  cases. 
As  a  result  the  primary  concern  here  is  the  requirements  of 
the  FCFS  program. 

A  FCPS  model  with  four  jobs  of  each  type  (N1  =  N2  = 
4)  and  two  PP's  (NPP  =  2)  resulted  in  3150  states  and  10080 
nonzeros  in  the  rate  matrix.   Dimensioning  the  state 
prcbability  array  (P)  and  the  three  arrays  used  for  stcrage 
of  the  balance  equations  (NCON,  INDEX  and  COEF)  accordingly, 
the  program  required  164K  bytes  of  storage  on  the  IBM  360-67 
computer  at  the  Naval  Eostgraduate  School  in  Monterey, 
California.   The  service  rates  used  in  this  model  were  0.6, 
1.2   and  0.9  for  type-one  jobs  and  0.8,  1.0  and  0.5  for 
type-two  jobs  at  the  CPU,  PP1  and  PP2,  respectively.   The 
branching  probabilities  were  0.55  and  0.45  for  type-one  jobs 
and  0.30  and  0.70   for  type-two  jobs.   Convergence  was 
attained  (using  the  rather  stringent  criterion  described  in 
section  1)  in  187  Gauss-Seidel  iterations.   The  total 
execution  time  for  the  program  was  212.13  seconds. 

A  second  case  using  the  FCFS  model  with  N1  =  2,  N2  = 
5  and  NPP  =  4  resulted  in  6930  states  and  35280  nonzeros  in 
the  rate  matrix.   Dimensioning  P,  NCON,  INDEX  and  COEF 
accordingly,  the  problem  required  396K  bytes  of  storage. 
The  service  rates  used  were  10.0  for  type-one  jobs  and  1.0 
for  type-two  jobs  at  the  CPU  and  the  reverse  of  these  at 
each  peripheral  processor.   Branching  probabilities  were  all 
.25  regardless  of  job  type  or  PP.   Convergence  was  attained 
in  206  iterations.   The  program  required  527.05  seconds  of 
execution  time. 
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The  PS  model  was  also  run  using  the  parameters  of 
this  second  example.   The  result  was  5754  states  and  30464 
nonzeros.   The  dimensions  of  the  FCFS  model  (6930  and  35280) 
were  used  in  this  model  and,  because  of  differences  in  the 
programs  themselves,  this  resulted  in  a  requirement  for  398K 
bytes  of  storage.   This  figure  could  have  been  reduced  by 
dimensioning  P,  NCON,  INDEX  and  COEF  according  to  actual 
requirements.   Convergence  was  attained  in  just  32 
iterations.   The  program  required  98.63  seconds  of  execution 
time.   The  measures  of  system  performance  reported  in  Table 
4.4  for  these  two  models  again  point  out  the  extent  of 
possible  error  when  a  FS  C?0  is  assumed  to  be  a  FCFS  CPU,  or 
visa  versa.   In  this  particular  comparison  type-one  jots  are 
"PP  bound"  and  type-two  jobs  are  "CPU  bound." 

Although  the  number  of  iterations  required  for 
convergence  of  the  Gauss-Seidel  technique  for  solution  of 
the  balance  equations  generally  increases  with  the  numfcer  of 
states,  wide  variation  exists  in  the  number  of  iterations 
required  for  a  problem  with  a  given  number  of  states. 
Indeed,  the  author  has  examples  of  cases  with  only  30  states 
reguiring  more  iterations  than  the  5754-state  case  reported 
in  the  preceding  paragraph.   This  should  not  be  too 
surprising.   As  pointed  out  in  Chapter  III,  the  number  of 
iterations  required  for  convergence  depends  upon  the 
magnitude  of  the  nondominant  eigenvalues  of  the  rate  matrix. 
The  eigenvalues  of  any  matrix  depends  upon  each  and  every 
entry  in  the  matrix.   Changing  a  single  value  in  the  matrix 
will,  in  general,  change  all  of  the  eigenvalues.   Thus, 
little  hope  is  seen  for  being  able  to  predict  the  number  of 
iterations  required,  or  the  running  time,  based  upon  the 
system  parameters.   The  running  time  can  be  reduced  by 
relaxing  the  convergence  criterion.   Since  the  real  items  of 
interest  are  the  measures  of  system  performance,  such  as 
idleness  probabilities  and  average  occupancies,  changes  in 
these  quantities  could  be  used  as  a  basis  for  determining 
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TABLE  4.4 

COMPARISON  OF  FCFS  AND  PS  PROGRAMS  IN  A 
CASE  IN  WHICH  NPP  =4,  N1  =  2  AND  N2  =  5 

FCFS  i  Program    PS  Program 

Number  of  states  6930  5754 

Number  cf  nonzeros 
in  rate  matrix 

CPU  idleness  probability 

PP  idleness  probability 

CPU  average  occupancy 

PP  average  occupancy 

CPU  throughput 

PP  throughput 

Note  that  performance  at  the  four  PP's  is  the  same.   The 
figures  reported  here  are  for  each  PP. 


35280 

30464 

000197 

.000920 

.890 

.696 

6.438 

5.273 

.  141 

.432 

1.308 

2.014 

.327 

.504 
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convergence.   Unfortunately,  any  savings  which  might  be 
realized  by  decreasing  the  number  of  iterations  in  this  way 
would  be  diminished  by  the  time  required  to  calculate  these 
measures  at  each  iteration.   In  many  cases  the  result  may  be 
a  net  increase  in  running  time. 

Ccmparisons  of  the  cere  requirements  and  running 
time  with  these  required  by  other  possible  solution  methods 
are  difficult  at  best.   None  of  the  examples  presented  here 
were  run  en  a  dedicated  machine.   Because  of  the  effects  of 
other  jots  in  the  system,  rerunning  these  same  jobs  would 
likely  result  in  different  execution  times.   Although  the 
author  was  "core  conscious"  in  programming  the  models,  there 
are  probably  still  ways  in  which  core  usage  could  be 
reduced. 

In  general,  the  following  statement  can  be  made.   If 
a  product-form  solution  exists,  running  time  is  bound  to  be 
shcrter  fcr  a  program  designed  to  take  advantage  of  it.   The 
reason  is  that  no  iterations  are  required,  and  it  is  net 
necessary  to  generate  and  store  the  balance  equations.   Even 
in  such  cases,  however,  some  of  the  concepts  and  techniques 
developed  in  Chapter  II  may  be  helpful  in  generating  all 
states  without  duplications. 
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V.   IHE  SOFTWARE-MONITOR  PROBLEM 


Software  monitors,  or  system  jobs,  are  used  in  most 
large-scale,  multiprogrammed  computer  systems  to  provide 
data  concerning  performance  of  the  system  and  usage  of  the 
system  by  individual  user  jobs.   In  this  chapter  some 
possible  sources  of  bias  in  the  data  provided  by  such 
monitors  are  investigated.   Modelling  the  computer  system  as 
a  closed  central-server  system,  two  types  of  software 
monitors  are  examined.   In  each  case  a  method  is  presented 
which  will  predict  the  bias  in  the  monitor. 

Consider  a  situation  in  which  a  central-server  model  is 
developed  for  an  existing  computer  system,  and  a  software 
monitor  is  used  to  collect  syst €m-perf ormance  data  in  order 
to  validate  the  model.   Ihe  results  of  this  chapter  could  be 
used  to  explain  differences  between  model  results  and 
monitor  results.   The  question  of  correcting  monitored  data 
for  bias  is  not  addressed  here.   This  remains  an  open 
guestion  which  has  apparently  not  been  considered  in  the 
open  literature. 


1.   THE  BEOELIM 


One  of  the  major  problems  in  modelling  real  systems  is 
monitoring  the  system  so  as  to  collect  the  data  necessary  to 
estimate  the  parameters  used  in  the  model.   The  same  problem 
arises  in  validation  of  resulting  models  where  system 
performance  must  be  monitored  for  comparison  with  model 
predictions. 
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For  computer  systems  two  types  of  monitors  are  generally 
available.   Hardware  monitors  are  external  devices  which, 
when  attached  to  various  processors,  determine  the  activity 
at  the  processors  by  measuring  passage  of  electrical  current 
and  changes  in  electrical  potential.   Software  monitors  are 
internal  devices,  or  system  programs,  which  use  computer 
processing  capabilities  (CPU  time  and  memory)  to  collect 
data.   Individual  program  accounting  data  (used  as  a  basis 
for  charging  system  users)  are  one  class  of  data  collected 
using  software  monitors. 

Software  monitors,  in  using  computer  resources,  somewhat 
degrade  the  resources  available  to  users  of  the  system.   For 
example,  system  programs  require  storage  (often  in  core)  , 
degrading  the  amount  of  storage  available  for  very  large 
user  programs.   They  also  require  processing  time  and 
therefore  increase  "turnaround  time"  for  users  programs. 
These  are  usually  not  serious  problems,  and  software 
monitors  are  generally  much  less  expensive  and  more  flexible 
than  hardware  mcnitors.   This,  plus  the  fact  that  it  is 
extremely  difficult  to  differentiate  between  individual 
programs  with  a  hardware  monitor,  makes  the  software 
monitors  the  most  likely  choice  for  data  collection,  whether 
for  accounting  data,  model-parameter  data  or  model-* 
validation  data. 

In  this  chapter  models  of  two  types  of  software  monitors 
are  discussed.   A  possible  source  of  bias  in  the  data 
collected  by  such  monitors  is  analyzed.   Throughout  the 
chapter  the  computer  system  under  consideration  is  modelled 
as  a  closed  central-server  network  of  queues  as  pictured  in 
Figure  1.1.   This  model  is  assumed  to  be  an  accurate 
representation  of  the  system,  and  steady-state  probabilities 
(assumed  to  exist)  calculated  from  the  model  are  therefore 
also  assumed  to  be  accurate.   The  purpose  of  the  software 
monitor  in  each  case  is  to  determine  system  performance  by 
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observing  this  probability  distribution.   A  model  of  the 
monitor  is  used  to  determine  the  distribution  "seen"  by  the 
monitor  sc  that  comparison  may  be  made  with  the  "true" 
distribution  and  bias  determined. 

In  section  3  consideration  is  given  to  a  monitor  which 
briefly  occupies  the  CPU  and  determines  the  state  of  the 
system  each  time  a  job  leaves  the  CPU.   In  section  4  the 
monitor  enters  the  system  infrequently,  waits  for  a  free 
server  at  the  CPU,  then  briefly  occupies  the  CPU  and 
determines  the  state  of  the  system.   In  both  sections  the 
problem  is  analyzed  for  a  Jackson-type  network  (as  described 
in  section  2)  and  then  extended  to  more  general  cases  such 
as  those  discussed  in  Chapter  II. 


2.   THE  EASIC  MODEL 


The  scft ware-monitor  models  discussed  in  sections  3  and 
4  will  be  analyzed  extensively  in  terms  of  the  closed 
central-server  model  depicted  in  Figure  1.1  with  the 
fcllowing  assumptions: 

(i)  Each  queue  is  a  single-server,  exponential 

queue  with  constant  service  rate  and  infinite 

capacity. 

(ii)  Only  one  type  of  job  circulates  in  the 

system. 

(iii)  Branching  probabilities  are  constant. 
Under  these  assumptions  the  queuing  network  under 
consideration  is  called  a  Jackson  network  (the  term  is 
borrowed  from  Melamed,  Zeigler  and  Beutler  [80])  in  honor  of 
J.  E.  Jackson  [56]  who  first  exhibited  a  product-form 
solution  for  the  steady-state  probability  distribution  of 
such  models.   Before  examining  this  solution,  consider  the 
balance  eguations  which  tne  solution  must  satisfy. 
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The  model  described  here  is  a  continuous-* time, 
finite-state  Markov  process  which  is  also  irreducible. 
Hence,  the  steady-state  probability  distribution  exists  and 
can  be  determined  by  solving  a  set  of  balance  equations 
together  with  a  normality  condition.   (See  Feller  [32].) 
Derivation  of  the  balance  equations  through  limiting 
arguments  and  the  Komolgorov  differential  equations  is  a 
standard  exercise  in  such  cases  and  is  therefore  omitted. 
(See  section  3  of  Chapter  I.)   Direct  development  of  the 
fcalance  equations  will  follow  the  introduction  of  some 
notation . 

Let  N  denote  the  total  number  of  jobs  circulating  in  the 

system.   The  M  +  1  processors  are  numbered  sc  that  processor 

i  is  PPi  for  i  =  1,  2,  ...,  M  and  processor  H  +  1  is  the 

CPU.   The  service  rate  at  processor  i  is  r   and  the 

i 

branching  prctability  (from  the  CPU)  to  PPi  is  b  .   By  the 

i 

Markov-like  assumptions,  a  state  A  is  completely  specified 

by  a  listing  of  the  number  of  jobs  at  each  processor.   Thus, 

A  =  (n  ,n  ,...,n    )  where  for  each  i  =  1,  2,    ...,  11+1,  n 
1   2       M+1  i 

is  the  number  of  jobs  at  processor  i  when  the  system  is  in 

state  A.   Note  that  n   +  n   +  ...  +  n     =N.   Also  note 

1     2  M  +  1 

that  every  vector  (n  ,n  ,...,n    )  of  nonnegative  integers 

12       M+1 

which  sum  to  N  represents  a  valid  state. 


For  any  state  A  =  (n  , n  ,...,n    ),  let  A   denote  the 

12       M+1         ij 

state  with  i-th  component  increased  by  one  and  j-th 

component  decreased  by  one;  e.  g.,   A   = 

ij 

(n  ,...,n  +1, ,n  -1,...,n    )   if  i  <  j .   Since  A    is  not 

1       i        j        M+1  ij 
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a  valid  state  if  n   =0,  any  statement  containing,  for 

J 

example,  the  probability  of  A    will  have  that  probability 

ij 

multiplied  by  0(n  )  where 

J 


1    if  n  >  0 
(5.1)  *(n) 

0    if  n  <  0 


so  that  a  large  variety  of  special  cases  need  not  be 
considered  to  handle  such  "non-states. " 

For  each  state  A,  P  (A)  will  denote  the  steady-state 
probability  that  the  system  is  in  state  A. 


Let  A  =  (n  ,n  ,...,n    )  be  any  state.   To  develop  the 
1   2       M+1 

balance  equation  associated  with  state  A,  note  that: 

(i)  The  rate  of  transition  out  of  state  A  (given 
the  system  is  in  state  A)  is  given  by 


M+1 
(5.2)  S  (A)  =  l  j6(n  ,)r  . 

(ii)  The  rate  of  transition  from  state  A      into 

j/M+1 

state  A  (given  n    >  0  and  the  system  is  in  state 

M  +  1 

A     )  is  r  . 
j,M+1       j 

(iii)  The  rate  of  transition  from  state  A 

M  +  1,j 

into  state  A  (given  n   >  0  and  the  system  is  in 

j 

state  A     )  is  t  r 

M+1,j       j  M+1 

(iv)  The  states  A       and  A       for  j  =  1,  2r 

j,M+1       M+1,j 
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. ..,  M  are  the  only  states  from  which  the  system 

can  transition  in  one  step  into  state  A. 
Thus,  the  talance  equation  associated  with  state  A  is 


M 

(5.3)      R(£)P(A)  =  p  (n    )  2  r  P  (A  .     ) 

M+1      J         D#M+1 
j=1 

M 

+  2   b  ,r   j*<n  .)P(A     ) 
3  M+1    3     M+1 f j 
j=1 


Using  Gordon  and  Newell* s  [46]  formulation  of  this 
problem,  Gaver  [38]  has  determined  that  the  solution  is 
given  by 


M 

(5.4)  P(A)  =  k  TT  qn' 

J-1  : 


where  K  is  a  constant  which  insures  that  the  appropriate 

normality  condition  is  met  and,  for  j  =  1,2,...,M,  g   = 

i 

b  r    /r  .   Note  from  (5.4)  that 
j  M+1   j 


(5.5)  P(A      )  =  q  .P(A) 

j,M+1      J 


and 


(5.6)  P(A     .)  =  P(A)/g. 

M+1,:  J 
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3.   THE  EVENT-KEYED  SOFTWARE  MONITOR 


Now  consider  attempting  to  monitor  system  performance  by 
means  of  a  software-monitor  job,  hereafter  called  the 
observer,  which  briefly  occupies  the  server  at  the  CPU  each 
time  one  of  the  other  N  jobs  completes  service  at  the  CPU. 
While  at  the  CPU,  the  observer  counts  the  number  of  jobs  at 
each  of  the  M  +  1  processors.   The  time  it  occupies  the 
server  is  assumed  to  be  exponentially  distributed  with  mean 
e  so  small  that  the  disturbance  to  system  operation  is 
minimal.   In  fact,  e  will  be  allowed  to  approach  zero  so 
that  the  possibility  of  a  change  in  the  state  of  the  system 
while  the  observer  is  cccuping  the  CPU  can  be  ignored.   In 
anticipation  of  the  limiting  operation  assume  that 

M+1 

(5.7)  1/e  >>  2   r 

The  observer  is  not  numbered  among  the  N  jobs  in  the  system. 

Note  that  since  the  observer  is  served  at  the  CPU  only 
when  a  job  completes  service  there  and  routes  to  one  of  the 
PP's,  and  since  no  transition  is  allowed  while  the  observer 
is  at  the  CPU,  the  state  in  which  all  jobs  are  at  the  CPU 
cannot  be  observed  by  the  observer.   For  example,  if  N  =  1, 
the  observer  will  always  see  the  CPU  idle.   Hence,  it  is 
clear  that  seme  bias  may  exist  in  the  system  performance  as 
reported  by  the  observer.   The  objective  here  is  to 
determine  this  bias.   Since  no  measures  of  system 
performance  have  been  specified  the  approach  will  be  to 
determine  a  computational  method  for  calculating  the 
steady-state  distribution  seen  by  the  observer  in  terms  of 
the  "actual"  steady-state  distribution  as  specified  in 
(5.4) .   From  these  two  distributions  the  same  measures  of 


235 


system  performance  may  be  calculated  and  compared. 

3.1.   Analysis 

The  difference  between  the  model  discussed  here  and 
the  one  discussed  in  section  2  is  the  introduction  of  the 
observer.   Corresponding  to  each  state  A  of  the  earlier 

model  is  a  state  A.   which  has  the  same  distribution  of  the  N 

jobs  among  the  M  +  1  processors  but  also  has  the  observer  in 

the  system.   This  can  be  depicted  in  vector  notation  by 

addition  of  an  (M+2)-nd  component  having  value  one  if  the 

observer  is  in  the  system  and  zero  if  not.   That  is,  tte 

state  (n  ,n  ,...,n    )  has  n   jobs  (exclusive  of  the 
1   2       M+2        j 

observer)  at  processor  j  for  j  =  1,2,... M+1  and  has  a  total 

of  n     +  n     jobs  (including  the  observer)  at  the  CPU.   By 
M+1     M  +  2 

convention  the  state  name  will  by  "starred"  if  n     =  1. 

M+2 

Thus,  if  A  =  (n  ,n    fc...#n    /0),   then  A   = 

12       M+1 

(n  ,n  ,. . . ,n    ,  1)  . 

1   2       M+1 

Since  the  system  is  different,  the  steady-stare 

distribution  determined  in  section  2  will  not  hold. 

(Indeed,  this  new  system  has  twice  as  many  states.)   So, 

consider  the  balance  equations  for  this  new  system.   The 

steady-stare  probabilities  are  denoted  by  P   indicating  a 

e 

dependence  upon  the  value  of  e. 

Consider  first  an  unstarred  state  A  = 

(n  ,n  ,...,n    ,0).   The  rate  of  transition  from  state  A, 
1   2       M+1 
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R(A),  is  easily  seen  tc  be  specified  fay  (5.2)  and  to  be 
independent  of  e.   The  transitions  described  in  (ii)  are 
still  valid,  but  those  described  in  (iii)  are  not  possible 
since  the  observer  enters  the  system  whenever  a  job  leaves 
the  CPU.   However,  a  transition  into  state  A  occurs  when  the 

observer  leaves  the  system  if  the  system  is  in  state  A  . 

Thus,  the  balance  equation  associated  with  A,  to  be  compared 
with  {5.  3)  ,  is 


(5.8)      B(A)P  (A)  =  0(n    )  5  r  P  (A      )  +  P  (A  )/€ 
e  M+1      j  e   j,M+1      e 

j=1 


New  consider  a  starred  state  A   = 


(n    ,n    ,...,n        ,1) .      Since   the   observer    (and   not   one   of    the 
12  B+1 


N  jobs)  is  ir  service  at  the  CPU, 


M 
* 

(5.9)  fi(A  )  =  1/e  +  2   r  0  (n  ) 

3-1  j   j 


Note  that  R(A  )  does  depend  upon  the  value  of  e.   In  fact, 

by  (5.7)   R(A  )  =  1/e  approximately.   The  transitions 

described  in  (iii)  are  valid  with  A  replaced  by  A  .   Those 

described  in  (ii)  are  similarly  valid  so  long  as  the  A 

j,M+1 

is  replaced  by  A     .   Thus,  the  balance  equation 

j,M+1 

associated  with  A   (compare  again  with  (5.3))   is 


237 


M 

(5.10)      B(A  )P  (A  )  =  0(n    )  5  r  P  (A      ) 

e  M+1     j  e   j,M+1 

j  =  1 

M 


+  y    b  r    0(n  )  P  (A     ) 
j  M+1    j   e   M  +  1,j 
j  =  1  J 


Eefore  proceeding  further  with  the  analysis, 
consider  its  goal:   to  determine  the  bias  introduced  by  the 
observation  method.   This  is  to  be  done  by  calculating  the 
steady-state  probability  distribution  seen  by  the  observer. 
But  this  distribution  is  the  conditional  steady-state 
probability  distribution  given  that  the  observer  is  being 
served  by  the  CPU.   That  is,  the  desired  result  can  be 
determined  by  finding  a  solution  to  (5.8)  and  (5.10)  and 

renormalizing  the  probabilities  of  the  starred  states, 
{P  (A  ) } .   In  principle,  this  can  be  done  by  applying 

€ 

solution  techniques  such  as  those  discussed  in  Chapter  III. 
Eut  (5.7)  makes  this  a  compt uationally  unsatisfying  thing  to 
do.   Also,  the  following  analysis  makes  it  unnecessary. 

Multiplying  (5.8)  through  by  e,  notice  that 


M 
* 

(5.11)      P  (A  )  =  e[R(A)P  (A)  -  0(n    )  "5  r  P  (A      )  ] 
e  e         M+1      j  e   j,M+1 


Since  R(A),  r   and  jZJ  ( n   )  are  all  independent  of  e  and 
j         M+1 

since  P  (A)  and  P  (A     )  are  bounded  above  by  one,  P  (A  ) 
e         e   j/M+1  *= 

approaches  zero  as  e  approaches  zero. 
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Now  using  (5.9)  in  (5.10): 


M 
*  *  * 

(5.12)  P  (A  )/e  =   5r  [  j*  (n  P  (A  )  +  0  (n        )  P  (A      )  ] 

e  D    D  e  M  +  1   e   j,M+1 

M 

+   5  b  r   0{n)  P  (A     ) 
j  M+1     e   M+1,j 
j  =  1 

As  a  result  of  (5.11),  it  is  seen  that  (5.12)  leads  to: 

M 

(5.13)  lira   P     (A    )/e   =      y    b   r        jz$(n   )P     (A  ) 

e  "      j    M+1         j      0      M+1,  j 

e — >0  j=1 

Using  this  result  to  determine  the  limit  in  (5.8) : 

a 

(5.14)  B(A)P  (A)  =  0(n         )    ?    r  P  (A      ) 

0  M+1   "   j  0   j,M+1 

j=1 

M 

+  y   b  r    0  (n  )P  (A     ) 
j  M+1    j   0   M+1,j 
j=1 


Comparing  this  with  (5.3)  note  that  (5.14)   are  the  balance 
equations  for  the  system  described  in  section  2.   These 
results  should  not  be  too  great  a  surprise.   They  say  that, 
as  the  mean  time  the  observer  is  in  the  system  approaches 
zero,  the  probability  of  finding  the  observer  in  the  system 
approaches  zero  and  the  probability  of  finding  the  system  in 
a  state  which  does  not  include  the  observer  (an  unstarred 
state)  approaches  the  probability  of  finding  the  system  in 
that  state  if  the  observer  does  not  exist.   This  limiting 
distribution  is  given  by  (5.4). 

Now  define  a  function  of  the  state  space  of  the 
basic  model  described  in  section  2: 
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(5.15) 


h  (A)  =   lim  P  (A  )  /e 

e 
e  —  >0 


Note  that  h  (A)  =  P  (A  )/e  +  0(e)  so  that  the  h(A)'s  are  a 


scaled  version  of  the  P  (A  )  «s.   That  is,  the  limit  (as  e 

e 

approaches  zero)  of  the  conditional  distribution  may  be 

found  by  normalizing  the  h(A)'s.   Now  using  (5.13),  (5.6) 
and  (5.2)  yields 


M 

(5.16)      h  (A)  *  2   b  r    0  (n  )  P  (A      ) 

j  M+1    j     M+1,j 
3=1 


M 


=  P(A)  ?  b  r   0(n  )/q 

.  ,  3  M+1    3    3 
3=1 


M 


=  E(A)  ?  r  0(n  ) 

3-1  j   j 


=  P(A)[3(A)  -  r    0(n    )  ] 

M+1    M+1 


Dsing  (5.16),  h(A)  can  be  determined  from  the  true 
distribution  (the  P (A) 's)  and  the  system  parameters. 
Normalizing  yields  the  distribution.   The  desired  measures 
of  system  performance  may  then  be  calculated  from  both 
distributions  and  the  results  compared.   Note  that  if  A   = 

(n  ,n  ,...,n   )  where  n   =  n   =  ...  =  n   =0  and  n     =  0, 

1   2       M+1  1     2  M  M+2 

then  R(A)  =  r     and  h  (A)  =  0.   That  is,  this  state  can 
M+1 
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never  be  observed  by  the  observer 


3.2-   Generalizations 

Consider  an  arbitrary  closed  network  of  M  +  1  queues 
and  N  jobs  circulating  among  them.   From  among  the  queues 
designate  one  the  CPU  and  the  others  the  PP's.   Sequentially 
number  the  queues  so  that  queue  M  +  1  is  the  CPU.   Now 
suppose  that  each  state  of  the  system  may  be  represented  as 
a  vector  of  integers  and  thatr  so  designated,  a  finite 
system  of  balance  equations  may  be  found  which  are  satisfied 
by  the  steady-state  probability  distribution  for  the  states. 
That  is,  assume  that  the  system  is  modelled  as  a  finite 
state,  continuous  time  Markov  process.   Examples  of  such 
models  are  found  in  Chapter  II  of  this  thesis  as  well  as  in 
Easkett,  Chandy,  Muntz  and  Palacios  [6]  and  Kelly  [63]. 
Assume  further  xhat  the  steady-state  distribution  is  known 
(or  can  be  determined)  . 

For  convenience,  also  assume  that  two  or  more  cf  the 
following  three  types  of  transitions  may  not  occur 
simultaneously: 

(i)  transitions  resulting  from  one  or  more  jobs 

leaving  the  CfU  destined  for  the  PP's; 

(ii)  transitions  within  the  CPU  (e.g.,  a  job 

changing  from  one  stage  of  service  to  another) ; 

(iii)  transitions  within  and  among  the  PP's  and 

from  the  PP's  to  the  CPU. 

For  each  state  A  define  three  subsets  of  the  state  space: 

C  (A)  =  {B  in  state  space  |  a  single  step 
1 

transition  can  rake  place  from  B  to  A  as 
specified  in  (i)} 
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C  (A)  =  {E  in  state  space  |  a  single  step 

transition  can  take  place  from  B  to  A  as 

specified  in  (ii)  } 
C  (A)  =  {B  in  state  space  |  a  single  step 

transition  can  take  place  from  B  to  A  as 

specified  in  (iii) } 
Because  of  the  assumptions  concerning  occurrence  of 
simultaneous  transitions,  if  a  single  step  transition  can 
occur  from  seme  state  B  into  state  A,  then  B  is  in  exactly 
one  of  C  (A)  ,  C  (A)  and  C  (A) .   The  balance  equation 

associated  with  any  state  A  can  be  written  as: 


(5.17)       B(A)P(A)  =     Z         r(3rA)P(B)  +     2         r(B,A)P(B] 

B€C  (A)  B€C  (A) 

1  2 


+     >    r(3,A)P(B) 

B6C  (A) 
3 


where  P(A)  is  the  steady-state  probability  that  the  system 
is  in  state  A,  r(BfA)  is  the  rate  of  transition  from  state  B 
to  state  A  given  the  system  is  found  in  state  Bf  and  B  (A)  is 
the  total  rate  of  transition  from  state  A  given  the  system 
is  found  in  state  A.   In  analogy  with  (5.2),  R (A)  may  te 
written  as  a  sum  of  r  (A,B)  over  all  states  B  into  which  the 
system  could  transition  from  state  A  in  one  step. 

Now  consider  a  software  monitor,  again  called  the 
observer,  which  enters  the  system  each  time  a  transition 
occurs  in  which  one  or  more  jobs  leave  the  CPU  destined  for 
the  PP's.   Again  assume  that  the  service  distribution  for 
the  observer  at  the  CPU  is  exponentially  distributed  with 
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mean  e.   Alsc  assume  there  is  only  one  observer,  so  that  a 
second  one  cannot  enter  the  system  if  another  transition  as 
described  in  (i)  occurs  while  the  observer  is  at  the  CEU. 
Furthermore,  assume  that  the  observer  cannot  be  preempted, 
cannot  be  blocked,  and  is  subject  to  no  bulking. 

As  in  subsection  3.1,   use  a  star  to  denote  presence 

of  the  observer  in  the  system.   Except  that  P  (A)  will 

e 

denote  the  steady-state  probability  that  the  system  is  in 
state  A,  the  notation  introduced  earlier  in  this  subsection 
will  be  adhered  to.   Note  that  r(A  , B)  and  E  (A  )  are 

dependent  upon  e,  though  r(B,A  )  and  r (B  ,A  )  are  not. 

* 
Also,  let  f  (A  )  be  the  rate  at  which  the  system  transitions 

out  of  A   by  having  the  observer  complete  service  at  the  CPU 

and  leave  the  system  (given  that  the  system  was  in  state 

*  * 

A  ).   For  many  queuing  disciplines,  f(A  )  =  1/s«   For  the  PS 

*  -1 

discipline  f  (A  )  =  [e(n   +1)  1    where  n     is  the  numter  of 

CPU  CPU 

jots  at  the  CPU  in  state  A   exclusive  of  the  observer. 

Assume  that  f  (A  )  increases  without  bound  as  e  approaches 

zero.   Recognizing  that  A   may  not  be  a  valid  state  foe  all 

unstarred  states  A,  define  P  (A  )  =  0  if  A   is  not  valid. 

e 

(For  an  example  of  such  a  case,  suppose  the  CPU  is  a  single 

server,  PCFS  queue  with  generalized  Erlangian  service 
distributions  and  consider  a  state  A  in  which  the  job  in 
service  at  the  CPU  is  in  its  second  stage  of  service.) 
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The  balance  equations  for  the  system  containing  the 
observer  are  given  by 


*     $ 
(5.18)      R(A)P  (A)  =  f(A  )P  (A  )  +     5    r(B,A)P  (B) 

e  e  e 

B€C  (A) 
2 


♦     >    r(B,A)P  (B) 

3€C  (A) 
3 


and,  for  all  A  such  that  A   is  valid, 

(5.19)      B(A  )P  (A  )  =     2  r(B,A  )P  (E) 


B6C  (A  ) 
1 


+      5    r  (B  ,  A  )  P  (3  ) 

e 

■sf.  if. 

3  €C  (A  ) 
2 

+      2  r  (B  ,A  )  P  (B  ) 

e 
*     * 

B  6C  (A  ) 
3 


Dividing  (5.18)  by  f  (A  )  and  passing  the  limit  as  e 

goes  to  zeic,  note  that  ?  (A  )  also  goes  to  zero.   Since 

e 

*  * 

fi  (A  )  is  the  sum  of  f  (A  )  and  rates  which  are  independent  of 

e,  R(A  )  P  (A  )  and  f(A  )  P  (A  )  have  the  same  limit  as  € 
e  e 

approaches  zero.   Since  r(B,A  )  and  r(3  ,A  )  are  independent 

of  e,  this  limit  is  determined  from  (5.19)  to  be: 
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(5.20)       lim  f (A  ) P  (A  )  = 


r(B,A  )P  (B) 
0 


B€C  (A  ) 
1 


Taking  the  limit  in  (5.18)  and  using  (5.20),  the  result  is 

seen  to  be  the  same  as  (5.17)  so  that  P  (A)  and  P (A)  are  the 

0 

same.   The  reason  this  is  true  is: 

(i)  If  A   is  valid,  B  €  C  (A  )  iff  the  system 

transitions  from  B  to  A   by  a  departure  from  the 


CPU.   That  is,  if  there  were  no  observer,  B  would 


be  in  C  (A)  . 
1 

(ii)  If  A   is  not  valid,  the  system  cannot 

transition  into  A  by  a  departure  from  the  CPU  even 

if  there  were  no  observer;  i.e.,  C  (A)  is  empty. 

1 

Define  a  function,  h,  of  the  state  space  of  the  model 

without  the  observer  by: 


(5.21)      h  (A)  =     2         r(3,A)P(B) 

B€C  (A) 


Note  that  normalizing  the  h(A)'s  yields  the  distribution 

seen  by  the  cbserver  if  (and  only  if)  f(A  )  is  constant  for 

given  e  as  a  function  of  A  (such  that  A   is  valid)  .   This  is 

true  in  the  many  cases  in  which  f (A  )  =  1/e,  but  is  not  true 
in  the  case  the  CPU  has  a  PS  queuing  descipline.   However, 
in  that  case  the  desired  result  is  obtained  if  (5.21)  is 
replaced  ty: 
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(5.22)      h  (A)  =  (n    +1)    5   r(B,A)P(B) 

CPU 

BGC  (A) 
1 


since  the  term  involving  e  in  fi  (A  )  is  1/[ e  (n   +1)  ].   Note 

CPU 

that  if  A   is  not  valid,  the  sum  in  (5.21)  and  (5.22)  is 


vacuous  and  h  (A)  =  0.   That  is,  such  states  cannot  be 
observed  by  the  observer. 

4.   THE  HASTY  AND  INFREQUENT  OBSERVER  SOFTWARE  MONITOR 


Suppose  new  that  system  software  is  monitored  by  means 
of  a  software-monitor  job  which  enters  the  system 
infrequently,  waits  for  an  available  server  at  the  CPU, 
briefly  occupies  the  first  available  server,  hastily  counts 
the  number  cf  jobs  at  each  of  the  M  +  1  processors,  and  then 
leaves,  enly  to  return  much  later.   This  monitor  job  is 
hereafter  called  the  hasty  and  infrequent  observer,  or  HIO. 
Assume  that  the  service  times  for  the  HIO  are  exponentially 
distributed  with  mean  e  and  that  times  between  visits  to  tne 
system  (i.e.,  frcm  the  time  it  leaves  the  system  until  it 
next  enters  the  system)  are  distributed  exponentially  with 
mean  1/w.   The  bias  is  examined  in  the  limiting  case  that  e 
and  w  both  approach  zero.   Though  it  is  not  necessary  to  do 
so,  the  case  in  which  e/w  =  1  can  be  borne  in  mind.   The 
approach  is  similar  to  that  exhibited  in  section  3. 
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4.1.   Analysis 

Consider  introduction  of  the  HIO  into  the  Jackson 
model  introduced  in  section  2.   As  in  subsection  3.1,   add  a 
component  to  the  state  vector  and  give  it  a  value  of  one  if 
the  HIO  is  in  service  at  the  CPU  and  zero  if  the  HIO  is  not 
in  the  system.   In  contrast  to  the  observer  discussed  in 
section  3,  the  HIC  can  be  in  the  system  but  net  in  service. 
This  case  will  be  denoted  by  a  two  in  the  (M  +  2)-nd  component 

of  the  vector  .   If  A  =  (n  fn  ,...,n    ,0)  is  a  state,  then 

12       M+1 

*  # 

so  is  A   =  (n  ,n  ,...,n    ,1)  and,  if  n     >  0,  so  is  a   = 
1   2       M+1  M+1 

(n  ,n  ,...,n    ,2)  .   The  notation  A    ,  R  (A)  and  0  (n)  will 
1   2       M+1  i,j 

have  the  same  meaning  it  had  in  subsection  3.1.   Again, 


P  (A)  will  be  the  steady-state  probability  of  state  A  in  the 
e 

system  containing  the  HIO.   P  (A)  is  dependent  upon  both  e 

e 

and  w,  even  though  the  notation  does  not  contain  the  w. 


Consider  any  state  A  =  (n  ,n  ,...,n    ,0) .   The 
1  12       M+1 

system  may  transition  from  state  A  by 

(i)  having  a  job  complete  service  at  one  of  the 

PP's  and  route  to  the  CEU;  or 

(ii)  having  a  job  complete  service  at  the  CPU  and 

route  to  one  of  the  PP's;  or 

(iii)  having  the  HIO  enter  the  system. 
Thus,  the  total  rate  of  transition  from  state  A,  given  that 
the  system  is  in  state  A,  is: 
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M+1 
(5.23)      E  (A)  =  y    r  0  (n  )  +  w 

j-i  j     j 


In  state  A   the  HIO  is  in  service  at  the  CPU.   Thus, 
a  transition  ty  (ii)  has  rate  1/e  and  a  transition  by  (iii) 
is  impossible.   As  a  result,  the  total  (conditional)  rate  of 

* 
transition  ficm  state  A   is: 


(5.24)      fi  (A  )  =  2    E  0(n    )  +  1/e 

j«i  j     j 


If  n    -  0  and  the  HIO  enters  the  system,  it 
E+1 

immediately  receives  service  since  the  CPU  is  idle.   Thus, 

# 
if  n     =  0,  A   is  net  a  valid  state.   So,  suppose  that  n 

M+1  M+1 

>  0.   Then,  since  the  HIO  is  in  the  system,  (iii)  dees  not 

apply  and  the  total  (conditional)  rate  of  transition  from 

# 
state  A   is  : 


M+1 
# 
(5.25)      E  (A  )  =   5  r  0 (n    ) 

3-1  j   j 

These  three  equations  should  be  compared  with  (5.2) 

which  gives  the  total  (conditional)  rate  of  transition  from 

state  A  if  the  HIO  does  not  exist.   Note  that  as  w 

approaches  zero,  H  (A)  from  (5.23)  approaches  the  result  from 

* 
(5.2) .   Also,  eR(A  )  approaches  one  as  e  approaches  zero. 

New  consider  the  balance  equations.   First  consider 
state  A.   Since  the  HIO  enters  the  system  infrequently,  a 
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single  step  transition  into  A  may  take  place  from  any  A 

j,M+1 

or  A      as  though  the  HIO  did  not  exist.   Thus,  the 
balance  equation  associated  with  state  A  will  contain  terms 

* 
similar  to  all  of  those  in  (5.3) .   However,  from  state  A 


the  system  can  transition  into  state  A  by  having  the  HIO 

complete  service  and  leave  the  system.   Thus,  one  term  must 
be  added  and  the  balance  equation  becomes: 


M 

(5.26)      R(A)P  (A)  =  (1/e)  P  (A  )  +  0  (n    )  ?  r  P  (A      ) 

e  e  M+1      j  e   j/M+1 

M 


+   ?  b  r   0(n  )  P  (A     | 
j  M+1    j   e   H+1,i 
j  =  1 


Now  consider  state  A  .   A  transition  into  state  A 

by  having  a  -ob  complete  service  at  PPj  is  valid  if  n    >  0 

M  +  1 

*  * 

and  must  be  frcm  state  A      .   A  transition  into  A   by 

j,M+1 

having  a  ^ob   complete  service  at  the  CPU  is  valid  only  in 

the  case  that  n   >  0  for  some  j  =  1,  2,  ,  M  and  the  job 

j 

which  completed  service  at  the  CPU  was  preventing  the  HIO 

frcm  receiving  service.   That  is,  such  a  transition  must  be 

# 
frcm  a  state  A      such  rhat  n   >  0.   Note  also  that  if  the 

HIC  arrives  hhen  the  system  is  in  state  A,  and  if  n     =  0, 

M+1 
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the  HIO  will  immediately  enter  service  and  the  system  will 


enter  state  A  .   Hence,  the  balance  equation  associated  with 


A   is: 


(5.27) 


R(A  )P  (A  ) 

e 


M 


=  0(n    )  5  r  P  (A      ) 
M+1   ""  j  e   j,M  +  1 
j=1 


M 


+  2   b  r   0(n  )  p  (A     ) 
j  M+1    j   e   M+1,j 
j  =  1 


+  [  1  -  0  (n    )  ]wP  (A) 
M+1     e 


Finally,  if  n     >  0,  consider  state  A  .   A 
M  +  1 

# 
transition  into  state  A   by  having  a  job  complete  service  at 


PPj  is  valid  if  n    >  1  and  must  be  from  state  A      .   The 

M+1  j/M+1 

# 
reason  n     must  be  at  least  two  is  that  A      is  not  a 
M+1  j/M+1 

state  if  n    =1.   (Remember,  the  number  of  jobs  at  the  CPO 
M+1 

#  # 

is  one  less  for  A      than  for  A  .)   A  transition  intc 

j/M+1 

# 
state  A   by  having  a  job  complete  service  at  the  CPU  is 


impossible  since  if  the  HIO  is  enqueued  at  the  CPU  pricr  to 


such  an  event,  it  would  enter  service.   However,  since  n 


M+1 


>  C,  the  system  enters  state  A   if  the  HIO  enters  the  system 
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when  the  system  is  in  state  A.   Thus,  the  balance  equation 

# 
associated  with  state  A   is: 


M 
#     #  # 

(5.28)      R<A  JP  (A  )  =  jz$(n    ^1)  5  r  P  (A      )  +  wP  (A) 

e  M+1        j  e   j/M+1       € 

j=1 


As  in  section  3,  the  distribution  seen  by  the  HIO  is 
conditional  upon  the  HIO  receiving  service  at  the  CPU  and, 
therefore,  is  determined  by  renormalizing  the  probabilities 
associated  with  the  starred  stares.   Hence,  the  final  result 

* 
cannot  change  if,  instead  of  the  P  (A  )'s,  seme  multiple  of 

e 

them  is  considered.   For  each  state  A  of  the  system  without 

the  HIO  (i.e.,  the  system  considered  in  section  2)  ,  define: 


(5.29)  h  (A)  =  P  (A  )/(ew) 

e       e 


If  the  value  cf  h  (A)  can  be  determined  for  each  such  A,  a 

e 

renormalizaticn  will  yield  the  sought-after  distributicn. 
It  will  also  be  convenient  to  define: 


# 
(5.30)  g  (A)  =  P  (A  ) /w 


for  each  state  A  =  (n  ,n  ,...,n    )   such  that  n    >  C. 

12       M+1  M+1 


Note  that  since  P  (A)  is  a  probability,  it  is 

e 

bounded  as  w  and  e  both  approach  zero.   Substituting  (5.30) 

into  (5.28)  and  dividing  through  by  w  yields: 


251 


M 
# 
(5.31)      R(A  )g  (A)  =  />(n    -1)  Z  r  g  (A      )  +  P  (A) 

e         M+1       j  e   j,M+1      e 

j=1 


for  each  state  A  =  (n  ,n  ,...,n    )  such  that  n     >  0. 

1   2       M+1  M+1 

Using  the  principle  of  mathematical  induction,  g  (A)  is  now 

e 

shewn  to  be  hounded  for  each  such  A  as  w  and  e  both  approach 

zero.   First  consider  a  state  A  such  that  n    =  1.   Then, 

M+1 

#  # 

(5.51)  reduces  to  H  (A  )g  (A)  =  P  (A).   Since  fi  (A  )  is 

e       e 

constant  with  respect  to  w  and  e  (see  (5.25))  and  P  (A)  is 

e 

bounded,  g  (A)  must  be  bounded  as  w  and  e  both  approach 
e 

zero.   Now  suppose  that  k  is  any  positive  integer  less  than 


N  and  that  g  (E)  is  known  to  be  bounded  for  every  state  B 
e 

with  k  or  fewer  jobs  at  the  CPU.   Let  A  be  any  state  such 


that  n    =  k  +  1.   Then  for  each  j  =  1,  2,  ...,  M, 
M+1 

g  (A     )  is  bounded  as  w  and  e  both  aporoach  zero  since 
e   j,M+1 

there  are  k  jobs  at  the  CPU  in  each  such  state.   Since 

# 
H(A  )  ,  0 (n    -1)  and  r   are  independent  of  w  and  e,  and 
M+1  j 

since  P  (A)  is  bounded,  (5.31)  says  that  g  (A)  is  bounded  as 
e  e 

w  and  e  tcth  approach  zero. 

Now  by  substituting  (5.29)  and  (5.30)  into  (5.27) 
and  dividing  through  by  w,  it  is  seen  that: 
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M 

(5.32)      efi  (A  )h  (A)  =  e^(n    )  ?  r  h  (A      ) 

e  M+1   ^   j  e   j,M+1 


j=1 


a 


+   5  b  r   £(n  )  g  <A     ) 
j  M  +  1    j   e   M+1,j 
j  =  1 

+  [1  -  j*(n    )  ]P  (A) 

M+1    e 

* 
Since  eR  (A  )  approaches  one  as  e  approaches  zero,  an 

inductive  argument  similar  to  that  used  in  the  last 

paragraph  shews  that  h  (A)  is  bounded  as  w  and  e  approach 

e 

zero  together. 

Substituting  (5.29)  into  (5.26)   yields: 

M 

(5.33)      R(A)P  (A)  =  wh  (A)  +  fi  (n    )  2    *    ?  (A      ) 

e        e         M  +  1      j  e   j,M+1 

M 

+   5  b  r    <t(n   )  P  (A     ) 
j  M+1    j   e   M+1,j 

j  =  1 


Since  h  (A)  is  bounded  as  w  and  e  approach  zero,  the  first: 
e 

term  on  the  right  side  of  (5.33)  vanishes  as  the  limit  is 

taken,  and  in  the  limit  (5.33)  is  identical  to  (5.3).   That 

isr  in  the  limit  P  (A)  approaches  the  P(A)  in  (5.4)  .   Since 

e 

h  (A)  and  g  (A)  are  bounded,  (5.29)  and  (5.30)   indicate 
e         e 

*  # 

that  P  (A  )  and  P  (A  )  both  approach  zero  with  w  and  e.   So 
e  e 

again  it  is  seen  that  as  the  time  between  observations 
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becomes  longer  and  the  duration  of  the  observations  become 
shorter,  the  probability  of  finding  the  HIO  in  the  system 
decreases  toward  zero,  and  the  probability  of  finding  the 
system  in  a  given  state  which  does  not  include  the  HIO  tends 
toward  the  probability  of  finding  the  system  in  that  state 
if  the  HIC  dees  not  exist. 


Since  the  limit  of  P  (A)  as  w  and  e  approach  zero  is 

e 

now  known  to  exist  (it  is  equal  to  P(A)),  the  same  inductive 

arguments  used  above  to  show  that  g  (A)  and  h  (A)  are 

e  e 

bounded  show  that,  for  each  state  A  =  (n  ,n  ,...,n    )  ,  h (A) 

1   2      M  +  1 

exists  and,  if  n    >  0,  g(A)  exists  where: 

M+  1 


(5.34)  h(A)  =   lim    h  (A) 

e 
e,  w — >0 

(5.35)  g  (A)  =   lim    g  (A) 

e 

e,w — >0 


Furthermore,  the  values  of  these  limits  may  be  determined  in 
terms  of  the  P (A)  •  s  and  the  system  parameters  by  first 
recursively  solving  for  the  g(A) 's  using  the  limiting  form 
of  (5.31)  : 


# 
(5.36)      R  (A  )g(A)  =  >zS  (n    -1)  2   r  9  (A  .     )  +  P  (A) 

il+ 1        j    3,  M+1 
j=1 


and   then   sclving   fcr    the   h(A)*s    using    the    limiting    form   of 
(5.32)  : 
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M 

(5.37)      h(A)  =  2   b  r    0(n.)g(A     .) 

;)  M+1    3  M+1,3 

3=1 

+  [1  "  J*(n    )  ]P(A) 
M+1 


The  distribution  seen  by  the  HIO  may  now  be  determined  by 
normalising  the  h(A) 's.   Note  that  h (A)  =0  if  all  of  the 
jobs  are  at  the  CPU  in  state  A.   The  necessary  calculations 
may  now  be  specified  in  the  form  of  an  algorithm: 


Algorithm  for 

Calculating  the  Steady-State  Distribution 

Seen  by  the  HIO 

(i)  Using  (5.4)  with  K  =  1,  calculate  P  (A)  for 

each  state  A  =  (n  ,n  ,...,n   )   such  that  n     = 

12       M+1  M+1 

0.   let  m  =  0.   (The  value  of  m  is  the  number  of 

jobs  at  the  CPU  in  the  states,  A,  for  which  h  (A) 

is  beinc  calculated.) 

(ii)  Using  (5.4)  with  K  =  1,  calculate  P  (A)  for 

each  state  A  -  (n  ,n  ,  ...,n   )   such  that  n     = 

12       M+1  M+1 

m  +  1 . 

(iii)  Using  (5.36),  calculate  g (A)  for  each  state 

A  =  (n  ,n  ,...,n   )  such  that  n     -m+1.   Note 
12       M+1  M+1 

# 
that  the  value  of  S(A  )  should  be  calculated  from 

(5.25). 

(iv)  Using  (5.37),  calculate  h(A)  for  each  state  A 

=  (n  ,n  ,...,n    )  such  that  n     =  m. 
1   2       M+1  M+1 

(v)  Increase  m  by  1.   If  m  <  N,  go  to  (ii)  . 

Otherwise,  set  h  (A)  =  0  where  A  =  (0  ,  0  , . .  .  ,0  ,  N)  . 

(vi)  Set  S   equal  to  the  sum  of  the  P(A)'s  and  S 
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egual  tc  the  sum  of  the  h  (A)  's.   For  each  state  A, 

P(A)/S   is  the  true  steady-state  probability  of 

P 

finding  the  system  in  state  A,  and  h(A)/S   is  the 

h 

steady-state  probability  that  the  HIO  finds  the 
system  in  state  A.   STOP. 


4.2.   Generalizations 

In  the  case  of  the  event-keyed  observer  of  section 
3,  the  cause  cf  bias  was  the  fact  that  the  observer  always 
received  service  immediately  after  a  job  had  routed  from  the 
CPU  to  the  PP's.   In  the  case  of  the  HIO,  this  is  not  always 
true,  although  it  is  true  in  every  case  where  the  number  of 
jots  at  the  CPU  is  no  smaller  than  the  number  of  servers 
there  when  the  HIO  arrives.   That  is,  if  the  number  of 
servers  at  the  CPU  does  not  exceed  the  number  of  jobs  in  the 
system  (and  if  the  CPU  does  not  have  a  PS  queuing 
discipline) ,  the  HIO  cannot  observe  any  state  in  which  all 
of  the  jets  in  the  system  are  at  the  CPU.   In  addition,  bias 
will  be  introduced  if,  in  some  cases,  the  HIO  must  wait 
before  receiving  service.   This  is  true  even  in  the  limit  as 
the  time  between  visits  gets  large  and  the  duration  of  a 
service  interval  gets  small.   The  reason  is  that  the  amount 
of  time  the  HIC  must  wait  (and  thus  the  number  of 
transitions  which  could  occur  in  that  time)  is  independent 
of  the  time  between  visits  and  the  service  time.   Note  that 
if  the  CPU  has  an  IS  or  a  PS  queuing  discipline,  then  no 
bias  exists  (in  the  limit)  since  the  HIO  always  receives 
immediate  service. 

Generalization  of  the  form  of  the  network  by 
allowing  jobs  to  move  from  one  PP  to  another  without  first 
receiving  service  at  the  CPU  complicates  the  problem. 
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Rather  than  solving  for  the  g (A)  's  recursively  as  was  done 
in  the  prededing  subsection,  it  is  necessary  to  solve  a 
series  of  systems  of  linear  equations. 

Finite  capacity  at  the  CPU  raises  guestions 
concerning  the  possibility  of  the  HIO  not  being  allowed  to 
enter  the  system.   It  must  also  be  assumed  that  the  HIO  is 
not  subject  to  bulking  restrictions  and  that  the  HIO  cannot 
be  preempted. 

Generalization  of  the  number  of  job  types,  of  the 
number  of  servers,  of  queuing  discipline  (except  as 
restricted  in  the  preceding  paragraph)  and  of  service 
distributions  (to  generalized  Erlangian  distributions)  can 
all  be  handled  in  an  analysis  similar  to  that  of  the 
preceding  sutsecticn  with  appropriate  changes  to  the  balance 
equations.   Details  of  these  generalizations  will  not  te 
discussed  here  since  the  analysis  is,  in  most  cases,  tedious 
without  beinc  instructive.   In  each  case  the  balance 
equations  must  be  generated  and  it  must  be  assumed  that  the 
solution  is  known  for  the  case  in  which  the  HIO  does  net 
exist.   The  distritution  seen  by  the  HIO  is  then  found  by 
solving  in  seme  recursive  manner  for  the  g(A)'s,  using  these 
to  get  the  h(A)'s,  and  normalizing  the  h(A)'s. 
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VI.   THE  CENTRAL-SERVER  MODEL  AS  A  SUBMODEL 


In  previous  chapters  ways  in  which  the  equilibrium 
distribution  can  be  determined  for  a  wide  variety  of 
central-server  models  have  been  discussed.   Many  potential 
applications  of  this  technology  can  be  modelled  as  a  network 
of  queues  having  a  central-server  model  as  a  major 
subsystem.   Sections  3  and  4  of  this  chapter  present  two 
examples  cf  this  situation.   In  both  sections  the 
central-server  submodel  represents  a  computer  and  the 
remainder  of  the  system  is  "external  to  core." 

In  order  to  be  able  to  apply  the  techniques  of  Chapters 
II  and  III,  certain  assumptions  must  be  made  concerning  the 
relationship  between  the  subsystem  and  the  remainder  of  the 
model.   These  assumptions  are  embodied  in  the  concepts  of 
decomposition  and  aggregation  as  discussed  by  Courtois  [27]. 
These  concepts  are  introduced  in  section  1.   Some  details 
concerning  their  application  to  models  having  central- server 
submodels  are  presented  in  section  2. 


1.   DECOMPOSITION  AND  AGGREGATION 


Consider  the  class  of  models  having  the  general  form 

pictured  in  Figure  6.1,  where  box  A  represents  the  portion 

of  the  queuing  network  external  to  the  central-server 

sutmodel.   The  first  subscript  on  each  a   ,  b   and  c 

ij    ij      ij 

refers  to  a  job  type  and  the  second  subscript  to  the 
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processor  nunter.   The  a    are  the  branching  probabilities 

ij 

of  the  central-server  submodel.   Thus,  for  each  job  type  i, 


M 

?  a   =  1 

d=i  ij 


For  each  i  and  j,  t,   is  the  probability  that  a  type-i  job 

"*■  J 

will  remain  within  the  central-server  subsystem  after 

completion  of  service  at  processor  j,  and  c   is  tne 

ij 
probability  that  it  will  leave  the  subsystem.   Thus,  b    + 

ij 

c   =1  for  each  i  and  j. 
ij 


So  that  use  may  be  made  of  the  steady-state  probability 

distribution  for  the  central-server  model,  it  is  assumed 

that  b    >>  c    for  each  job  type  i  and  each  queue  number  j 
ij     ij 

=  1,  2,  ...,    M+1.   This  assumption  allows  use  of  the  ideas 

of  decomposition  and  aggregation  as  discussed  by  Courtois 
[27].   According  to  Courtois,  a  system  is  decomposible   f: 
i(i)  interactions  within  groups  can  be  studied  as 
if  interactions  among  groups  did  not  exist,  and 
(ii)  interactions  among  groups  can  be  studied 
withcut  reference  to  within  group  interactions. 
In  the  context  of  Figure  6.1   this  means  that  interactions 
between  the  central-server  submodel  and  the  remainder  of  the 
queuing  network  (box  A)  are  so  infrequent,  as  compared  to 
those  within  the  submodel,  that  the  submodel  is  able  tc 
"reach  equilibrium"  between  successive  interactions  with  the 
remainder  of  the  network.   As  a  result  the  steady-state 
distributions  may  be  determined  for  the  central-server 
submodel,  Figure  6.2,   and,  then,  used  to  determine  the 
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Figure  6 . 2--Central-Server  Submodel 
from  General  Network 


properties  of  an  aggregate  queue,  C,  for  use  in  studying  the 
entire  model  with  the  central-server  submodel  replaced  by 
the  aggregate  queue,  Figure  6.3.   A  similar  technique  is 
used  by  Chandy,  Herzog  and  Woo  [21]  in  their  application  of 
Norton's  theorem.   In  practical  terms  the  assumption  that 

b   >>  c    for  all  i  and  j  is  equivalent  to  an  assumption 
ij     ij 

that  jobs  visit  the  CPU  (queue  M  +  1)  many  times  before 
departing  frcm  the  subsystem. 

Actual  inplementation  of  the  method  outlined  above 
would,  in  general,  involve  sufficient  assumptions  concerning 
the  queues  to  allow  use  of  a  product-form  solution,  ala 
Baskett,  et.  al.   [6],  or  numerical  techniques  such  as  those 
discussed  in  Chapters  II  and  III.   In  particular,  so  that 
the  model  in  Figure  6.3  can  be  analyzed,  a  form  of  service 
distribution  must  be  assumed  for  the  aggregate  queue  C.   For 
simplicity,  service  distributions  at  C  are  assumed  to  be 
exponential  with  state-dependent  rates. 

Section  2  examines  more  closely  the  assumed  form  of  the 
aggregate  gueue  C  and  indicates  how  to  determine  the  rates 
frcm  the  steady-state  solution  of  the  central-server  model, 
Figure  6.2.   Sections  3  and  4  contain  examples  of  situations 
in  which  this  technique  could  be  applied  in  models  of 
multiprograirmed  computer  systems.   Section  3  examines  a 
model  in  which  the  tape-mounting  process  is  considered. 
Section  4  considers  models  of  core  allocation  to  jobs  as 
they  enter  cere  from  the  input  queue. 
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2.   AGGREGATION  OF  THE  CENTRAL-SERVER  MODEL 


Consider  the  problem  of  replacing  the  central-server 
submodel  in  figure  6.1  by  a  single  aggregate  queue,  C.   In 
order  to  analyze  the  resulting  model,  as  depicted  in  Figure 
6.3,  the  properties  of  this  aggregate  queue  must  be  either 
kncwn  or  assumed.   It  is  assumed  that  all  service 
distributions  at  queue  C  are  exponential,  though  the  service 
rates  may  te  state-dependent.   In  this  section  these  rates 
are  determined  under  the  assumption  that  interactions 
between  the  aggregate  queue  and  the  remainder  of  the  network 
are  infrequent  enough  that  equilibrium  is  reached  within 
queue  C  between  pairs  of  such  interactions.   A  "conditioning 
-  unconditicEing"  approach  similar  to  that  often  used  in 
renewal  theory  is  used  here. 

Notice  ficm  Figure  6.1  that  any  job  circulating  amcng 
the  queues  of  the  central-server  submodel  could  be  the  next 
job  to  emerge  from  the  submodel  and  enter  the  remainder  of 
the  network  (box  A).   Thus,  the  queuing  discipline  assumed 
for  queue  C  nust  reflect  this  property.   In  their 
applications  of  Norton's  theorem  to  networks  of  queues, 
Chandy,  Herzcg,  and  Woo  [21]  make  the  aggregate  queue  an 
infinite  server  queue  and  "dispense"  the  total  service  rate 
amcng  all  cf  the  jobs  at  the  queue.   A  similar  approach  is 
taken  here,  except  that  the  infinite  server  assumption  is 
misleading  since  box  A  may  may  contain  a  queue  for  C.   This 
is  the  case  in  the  models  discussed  in  sections  3  and  4 
where  the  central-server  submodel  represents  a  computer. 
The  number  of  jobs  "in  the  computer"  is  constrained  by  the 
amount  of  core  and  other  resources  available.   C  is 
therefore  pictured  as  a  multiserver  queue  in  which  the 
number  of  servers  is  state-dependent. 

Since  the  discussion  involves  both  the  model  depicted  in 
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Figure  6.3,  hereafter  called  the  overall  model,  and  the 

central-server  model  depicted  in  Figure  6.2  concurrently,  it 

is  convenient  to  have  notation  which  distinguishes  between 

the  two  models.   Denote  by  W  the  state  space  of  the  model 

depicted  in  figure  6.3.   Consider  any  state  S  in  W .   A 

specification  of  the  number  jobs  of  each  type  in  service  at 

queue  C  must  be  included  as  a  part  of  any  description  of 

state  S.   Because  of  the  assumptions  concerning  the  queuing 

discipline  ard  service  distributions  at  C,  nc  further 

information  ccncerning  the  contents  of  queue  C  is  required 

for  a  description  of  state  S,  though  additional  information 

may  be  reguired  to  specify  the  waiting  line  for  entry  to  C. 

As  seen  in  Chapter  II,  changing  the  composition  of  jobs  in  a 

central-server  model  (that  is,  changing  the  number  of  jobs 

of  one  or  mere  job  types)  changes  the  state  space.   This  is 

true  even  if  the  total  number  of  jobs  does  not  change. 

Denote  the  state  space  of  the  central-server  model  depicted 

in  Figure  6.2  by  Q (S)  when  the  overall  model  is  in  state  S. 

Since  the  composition  of  jobs  at  C  is  specified  by  a 

description  of  S,  the  dependence  of  the  state  space  of  the 

central-server  model  upon  this  composition  of  jobs  is 

expressed  as  a  dependence  upon  the  state  S.   Note  that  it  is 

possible  that  Q{S    )    =    Q(S  )  for  S  *   S  ,  ooth  in  W. 

12        12 

Now  let  S  be  any  element  of  W.   The  service  rate  at  the 
aggregate  queue  C  for  jobs  of  each  type  is  determined  fcy 
conditioning  upon  the  state  (from  Q(S))  of  the 
central-server  model  represented  by  C,  and  then 
unconditicnirg  by  .using  the  steady-state  probabilities  for 
the  central-server  model.   Note  that  the  result  is  still 
conditional  upon  the  overall  system  being  in  state  S.   That 
is,  the  service  rates  at  C  are  state-dependent. 

Let  the  composition  of  jobs  in  the  central-server  model. 
Figure  6.2,  he  that  specified  in  a  description  of  state  S  as 
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the  composition  of  jobs  in  aggregate  queue  C.   Then  the 

state  space  for  the  central- server  model  is  Q (S)  .   Also 

assume  that  the  steady-state  distribution  has  been 

determined,  maybe  by  using  the  techniques  discussed  in 

Chapter  II  or  by  using  a  product  form  solution.   For  each  A 

in  Q  (S) ,  let  P  (A)  denote  the  steady-state  probability  that 
S 

the  central-server  system  is  found  in  state  A. 


Let  A  be  any  state  in  Q  (S) .   From  a  description  of  A  and 

a  specification  of  the  parameters,  service  distributions, 

queuing  disciplines,  and  so  forth  for  the  central-server 

model,  the  rate,  r   ,  at  which  the  system  leaves  state  A  by 

ij 

having  a  type-i  jot  complete  service  at  queue  j  for  each  job 

type  i  and  each  queue  j  =  1,  2,  ...,  M  +  1  (given  that  the 

system  is  in  state  A)  can  be  determined.   The  following 

example  illcstrates  how  r    would  be  calculated  in  a  variety 

ij 

of  situations. 


Example  6.  1 

Suppose  PP 1  contains  two  type-one  jobs  and  three  type- 
two  jobs  in  state  A.   Furthermore,  in  the  cases  in  which  it 
is  a  factor,  the  type-two  jobs  are  known  to  have  all  arrived 
prior  to  the  type-cne  jobs.   (If  it  is  a  factor,  this 
information  would  be  contained  in  the  state  description  for 
state  A.)   Suppose  that  all  service  distributions  at  PF1  are 

exponential  with  service  rate  v   for  type-i  jobs. 

i 

(a)  If  EP1  has  a  FCFS  discipline  and  m  servers, 

then  r    =0  and  r   =  mv   if  m  <  3,  but  r    = 
11  21      2  11 

v  min(2,m-3)  and  r   =  3v   if  m  >  3. 
1  21      2 
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(b)  If  EF1  has  a  preemptive  LCFS  discipline  and  m 

servers,  then  r   =  mv   and  r    =  0  if  m  <  2,  but 

11      1       21 

r    =  2v   and  r    =  v  min(3,m-2)  if  m  >  2. 
11      1       21     2 

(c)  If  PP 1  has  a  preemptive  priority  discipline 
and  m  servers,  the  results  of  (a)  apply  if 
type-twc  jobs  have  priority  over  type-cne  jobs, 
but  the  results  of  (b)  apply  if  type-one  jobs  have 
priority  over  type-two  jobs. 

(d)  If  PP1  has  a  PS  discipline,  then  r   =  2v  /5 

11      1 

and  r    =  3v  /5. 
21      2 

(e)  If  the  CPU  has  finite  capacity  and  can  accept 

no  further  jobs  until  a  job  leaves,  then  r    and 

11 

r    are  as  specified  in  (a)  -  (d)  with  a  replaced 
21 

by  m-k  where  k  is  the  number  of  blocked  servers  at 

queue  1.   In  particular,  if  k  =  m,  r    =0=r 

1  1         21 

New  suppose  that  the  service  distributions  at  PP1  are 

generalized  Erlangian  and  let  v    be  the  service  rate  in  the 

ik 

k-th  stage  and  g    the  probability  of  departing  the  queue 

ik 

after  completing  service  at  the  k-th  stage  for  type-i  jobs. 

(f)  If  PF1  has  a  single  server  and  a  FCFS  queuing 

discipline,  then  r   =0  and  r    =  g   v   if  the 

11  21     2k  2k 

(type-twc)  job  in  service  is  the  k-th  stage  of 
service . 


Now  returning  to  Figure  6.1,  notice  that,  if  the 

central-server  subsystem  transitions  from  state  A  by  having 

a  type-i  job  leave  queue  j,  then  the  probability  that  that 

job  will  leave  the  subsystem  (at  that  time)  is  c   .   Thus, 

ij 
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re    is  the  rate  at  wnich  type-i  jobs  complete  service  at 
ij  ij 

queue  j  and  depart  the  central-server  subsystem  given  that 

the  subsystem  was  in  state  A.   From  this  it  is  seen  that, 

for  each  job  type  i,  the  rate,  R  (A) ,  at  which  type-i  jobs 

i 

depart  the  subsystem  given  that  the  subsystem  is  in  state  A 
is  given  ty: 


M+1 
(6.1)  R  (A)  =  J    r   c 

:=1 


Note  that  although  the  values  of  the  r    are  also 

ij 

conditional  upon  the  central-server  subsystem  being  in  state 
A,  this  condition  has  been  suppressed  in  our  notation. 

Using  the  steady-state  probability  distribution  for  the 
central-server  model  with  state  space  Q(S),  the  condition  on 

state  A  may  new  te  removed  to  determine  the  rate,  T  (S) ,  at 

i 

which  the  overall  system  transitions  from  state  S  by  having 

a  type-i  job  leave  the  aggregate  queue  C  given  that  the 
system  is  in  state  S: 


(6.2)  1  (S)  =    "5    R  (A)  P  (A) 

i  is 

A€Q  (S) 


At  this  pcint  more  precision  is  possible  in  specifying  the 
character  of  the  aggregate  queue  C.   C  is  a  multiserver 
queue  with  exponential  service  distributions.   Both  the 
number  of  servers  and  the  service  rates  are  state-dependent, 
If  the  overall  system  is  in  state  S  (an  element  of  W)  ,  and 
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if   n      is   the    number   of   type-i    jobs   at    C   in    state   S,    then   the 
i 

service  rate  for  each  type-i  job  at  C  is  T  (S)/n   if  n   >  0 

i     i      i 

(and  is  immaterial  if  n   =  0) .   Furthermore,  if  n  is  the 

i 

total  number  of  jobs  at  C  in  state  A,  then  n  is  also  the 
numter  of  servers  at  C,  given  the  system  is  in  state  S. 

In  truth,  the  exact  character  of  C  is  usually 

unimportant  since  the  goal  is  examination  of  the  system 

represented  in  Figure  6.1  by  studying  the  balance  equations 

(or  the  Kcitclgorov  differential  equations)  for  the  overall 

system  as  represented  in  Figure  6.3.   For  these  equations, 

only  the  1  (S)  are  required, 
i 

Computationally  it  is  generally  unnecessary  to  solve  the 

appropriate  central-server  model  each  time  a  new  state  from 

H  is  considered.   Typically  many  states  from  W  will  have  the 

same  composition  of  jobs  at  C.   Hence,  the  procedure  tc 

follow  is:   First,  for  each  feasible  composition  of  jobs  at 

C,  solve  the  corresponding  central-server  model  for  the 

steady-state  probability  distribution.   Use  this 

distribution  in  (6.2)   to  calculate  T  (S)  for  each  job  type 

i 

i,  and  store  the  T  (S) .   Then,  in  developing  the  balance 

i 

eguations  for  the  overall  model,  use  (for  each  S  in  W)  the 

T  (S)  corresponding  to  the  composition  of  jobs  at  C 
i 

appropriate  for  the  state  S  under  consideration.   Storage  of 

the  T  (S)  for  efficient  recall  is  aided  by  the  fact  that  the 
i 

composition  cf  jobs  at  C  can  be  represented  as  a  vector 

(n  ,n  ,...,n  )   of  nonnegative  integers,  where,  for  each  i  = 
1   2      k 

1,  2,  ...,   k,  n   is  the  number  of  type-i  jobs  at  C  in  the 

i 


composition  under  consideration,  and  the  number  of  job  types 

is  k.   The  procedures  of  section  3  of  Chapter  II  can  be  used 
to  sequence  the  vectors  representing  feasible  compositions. 

The  T  (S)  may  be  stored  in  an  array  T  such  that  T(i,j)  is 

i 

T  (S)  for  the  j-th  composition.   For  each  state  S,  it  would 
i 

then  be  necessary  only  to  determine  the  sequence  number  j 

corresponding  to  the  composition  of  jobs  at  C  specified  in 

the  vector  description  of  S.   Procedures  for  doing  this  are 

also  specified  in  section  3  of  Chapter  II. 


3.   THE  TAPE-MOUNT  PROBLEM 


In  most  modern  computer  centers  users  are  able  to  retain 
and  use  privately  owned  storage  space,  data  sets  and  program 
libraries  on  magnetic  tapes  and  disk  packs.   Each  job 
requiring  use  of  such  a  device  must  wait  for  the  appropriate 
number  of  tape  drives  and  disk  drives  to  be  allocated  and 
for  the  tapes  and  disks  to  be  mounted  before  core  can  te 
allocated  for  it.   The  physical  operation  of  mounting  the 
tapes  and  disk  packs  is  performed  by  computer  operators. 
Because  of  the  high  speed  with  which  computers  are  able  to 
handle  most  jobs,  and  because  the  computer  operators  have 
many  other  duties,  the  time  required  to  mount  a  tape  or  disk 
after  the  drive  has  been  allocated  is  often  many  times  the 
time  required  for  the  computer  to  service  a  single  job.   As 
a  result  the  computer  could  become  severely  underutilized 
if,  during  seme  period  of  time,  most  jobs  require  such 
mounts.   This  is  the  tape-mount  problem.   In  this  section 
only  tapes  are  considered  explicitly.   Private  disk  packs 
could  be  handled  the  same  way  or  could  be  lumped  together 
with  tapes  as  a  single  type  of  resource. 
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In  an  effort  to  examine  the  tape  mount  problem 

consideration  is  given  to  the  model  depicted  in  Figure  6.4. 

(Figure  6.4  should  be  compared  with  Figure  6.3.)   Two  types 

of  jobs  transit  through  the  system.   Type-one  jobs  require 

nc  mounts,  but  type-two  jobs  do.   The  box  marked  TM 

represents  the  tape  mounting  procedure;  Q   is  the  line  of 

1 

jobs  waiting  for  drives  to  be  allocated  or  tapes  to  be 

mounted.   C  represents  the  computer  itself  and  is  an 
aggregate  queue  representing  a  central-server  submodel  (see 
Figure  6.1).   Q   is  the  "core-allocation  queue"  where  jobs 

await  servicing  by  the  CPU.   All  mounts  have  been  performed 

for  type-two  jobs  in  Q  . 

2 


The  jobs  at  C  have  been  allocated  core  and  are  currently 
being  serviced  by  the  computer.   The  maximum  number  of  jobs 
at  C  may  be  restricted  to  be  the  maximum  attainable  level  of 
multiprogramming  for  the  computer  under  consideration  (or 
seme  average  level  of  multiprogramming) ,  or  the  ideas 
discussed  in  the  next  section  may  be  utilized  here  to  also 
model  the  core-allocation  procedure.   For  the  present 
discussion  assume  that  there  is  some  maximum  number,  N,  of 
jobs  which  can  simultaneously  be  at  C,  and  that  no 
combination  cf  N  jobs  is  restricted  because  of  core 
allocation.   (Note  that  there  may  be  a  restriction  if  the 
facility  does  not  have  a  sufficient  number  of  tape  drives  to 
allow  N  type-two  jobs  in  the  system  simultaneously,  but  this 
restriction  is  not  because  of  core  allocation.)   Also  assume 
that  each  type-two  job  requires  exactly  one  tape,  and  that  a 
finite  number  L  of  tape  mounts  are  available  to  the  system. 
Thus,  the  total  number  of  type-two  jobs  at  TM,  Q   and  C 

cannot  exceed  1,  though  more  type  two  jobs  might  be  at  Q  . 
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As  a  consequence,  it  is  possible  to  have  TM  empty  even  if  Q 
is  not. 

Before  considering  the  box  marked  D  in  Figure  6.4, 
consider  the  central-server  model  from  which  (ala  section  2) 
C  will  receive  its  properties.   This  model  is  depicted  in 
Figure  6.2.   Since  each  type-two  job  has  its  own  tape,  and 
since  all  tapes  must  be  mounted  before  a  job  can  enter  C, 
there  is  no  queuing  foe  tape  usage  and  no  degradation  cf 
service  tc  jebs  using  tapes  if  another  job  begins  to  use  a 
tape.   Thus,  all  tape  drives  together  may  be  considered  as  a 
single,  infinite  server  queue.   Letting  PP1  be  this  queue, 

a    =0  since  no  type-one  jobs  use  tapes.   (Note  that  the 
1 1 

tapes  model  discussed  in  Chapter  IV  meets  the  description 

given  here.) 

Now  consider  the  box  marked  D  in  Figure  6.4.   This  box 

will  be  modelled  in  different  ways  for  different  models. 

For  example,  if  it  is  assumed  that  there  are  exactly  n 

1 

type-one  jobs  and  n   type-two  jobs  in  the  model  of  interest 

2 

at  every  instant  of  time,  box  D  need  not  exist.   If  the  n 
1  1 

and  n   vary  but  n   +  n   remains  constant,  box  D  may 
2  1     2 

represent  a  "place"  where  in  zero  time  a  job  does  or  dees 

not  change  type  according  to  a  probability  distribution.   On 

the  other  hand,  box  D  could  represent  source  and  sink  with 

jobs  entering  D  disappearing  (leaving  the  system)  and 

periodically  new  jobs  being  created  (read  in)  and  routing  to 

Q   or  Q   according  to  job  type.   If  the  latter  is  the  case, 
1      2 

a  ffaxirauai  nunber  of  jobs  in  the  network  should  be  fixed  to 
keep  the  state  space  finite. 
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Ignoring  the  effect  of  D  on  the  state  description  (if 

any),  and  assuming  exponential  service  at  TM,  a  state  of  the 

system  may  be  represented  as  (t  ,t    ,t    ,x  ,x  , x  )  where 

12   3   12      k 

t   =  the  number  of  type-one  jobs  at  C 
1 

t   =  the  number  of  type-two  jobs  at  C 

t   =  the  number  of  (type-two)  jobs  at  Q   and  TM 
3  1 


combined 


1   z 


,x   =  the  order  of  job  types  at  Q 
k  2 


The  restrictions  to  be  used  in  determining  which  nonnegative 
integer  vectcrs  represent  valid  states  include: 


Rule  1 

Rule  2 

Rule  3 

Rule  4 
system 

Rule  5 

Rule  6 


t  <  N. 
1 

t  +  t   <  N. 

1     2 

t  <  L. 
2 

t  +  t   +  t   <  maximum  number  of  jobs  in 
12     3 


k  =  0  if  t   +  t   <N. 
1     2 

t  +t   ♦  t   +  k  <  maximum  number  of  jobs 
1     2     3 


in  system.   If  the  total  number  of  jobs  is  fixed, 

this  sum  must  equal  the  total  number. 

Rule  7:   The  number  of  twos  in  (x  ,  x  ,...,x  )  < 

1    2       k 

L  -  t  . 
2 

For  each  such  state,  the  number  cf  jobs  in  service  at  TM  is 

the  maximum  possible  given  that  it  aust  be  no  larger  than 

t  ,  no  larger  than  the  number  of  servers  at  TM,  and  no 
3 

larger  than  the  number  of  available  tape  drives.   The  number 
of  available  tape  drives  is  L  minus  the  sum  cf  t   and  the 
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number  of  twos  in  (x  ,x  ,...,x  ). 

1   2      k 


Notice  that  the  effect  of  changing  the  number  of  tape 

drives  may  be  studied  by  varying  L.   The  effect  of  changing 

the  number  of  operators  might  be  studied  by  changing  the 

number  of  servers  at  TM.   The  effect  of  dedicating  operators 

to  the  tape  ncunting  operation  might  be  studied  by 

increasing  the  service  rate  at  TM.   The  service  distribution 

at  TM  could  also  be  modified  to  a  generalized  Erlangian 

distribution  by  adding  another  component  to  the  state  vector 

for  each  server  {see  subsection  2.5  of  Chapter  II)  .   The 

case  of  "bulk  mounting"  can  also  be  modelled  if  an 

additional  cemponent  is  added  to  the  state  vector.   In  bulk 

mounting  the  operators  wait  until  Q   contains  at  least  t 

1 

tapes  for  which  taje  drives  have  been  allocated.   They  then 

mcunt  tapes  until  either  all  drives  are  busy  or  Q   is  empty. 

The  component  added  to  the  state  vector  should  be  one  when 

the  mounting  operation  is  underway  and  zero  otherwise.   It 

changes  frcm  zero  to  one  when  the  number  of  jobs  at  Q   and 

1 

the  number  of  available  tape  drives  both  reach  t  and  from 
one  to  zero  when  one  of  these  two  numbers  drops  to  zerc. 

The  numter  of  tapes  required  by  each  type-two  jot  could 
also  be  generalized.   This  could  be  accomplished  by  using 
type-two  jobs,  type-three  jobs,  ...,   where  type-k  jobs 
require  that  k- 1  tapes  be  mounted. 

Now  consider  another  model  in  which  the  decomposition 
and  aggregation  concepts  are  useful.   Once  again  the 
aggregate  queue  will  be  a  computer  which  can  be  represented 
as  a  central-server  system.   This  time  the  problem  is  cne  of 
core  allocation. 
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4.   THE  CCBE- ALLOCATION  PROBLEM 


One  of  tte  constraining  factors  in  the  level  of 
multiprogramming  of  a  modern  computer  is  the  amount  of 
primary,  cr  core,  storage  available.   How  available  core  is 
allocated  to  the  jobs  awaiting  service  is  an  important 
decision  for  the  managers  of  a  computer  facility.   For 
example,  suppose  a  job  requiring  100K  is  at  the  front  of  the 
queue  and  a  large  number  of  25K  jobs  follow  it.   If  25K 
becomes  available,  should  that  25K  remain  idle  until  100K 
has  accumulated  and  the  first  job  in  the  queue  can  be 
served?   Or  should  the  smaller  jobs  be  allowed  to  bypass  the 
10CK  job  and  utilize  the  available  core?   From  the 
standpoint  of  efficient  utilization  of  the  available 
resources,  the  latter  is  the  course  to  adopt.   However,  in 
an  extremely  busy  computer  center  such  a  policy  could  lead 
to  serious  turnaround  problems  for  larger  jobs  (and 
potential  less  of  goodwill  with  customers  who  are  providing 
a  large  portion  of  the  center's  income).   In  addition, 
searching  long  input  queue  lists  for  a  job  small  enough  to 
fit  into  available  core  can  be  a  costly  overhead  item. 

An  alternative  to  both  the  strict  FCFS  scheme  and  the 
bypass  scheme  is  ore  in  which  the  queue  is  divided  intc  two 
parts.   The  first  k  jobs  (i.e.,  those  which  have  been 
waiting  the  longest)  form  what  will  be  called  the  anteroom 
queue.   All  ether  jobs  form  the  "hallway"  gueue.   Althcugn 
bypassing  is  permitted,  only  the  jobs  in  the  anteroom  queue 
can  be  allocated  core.   Thus,  it  is  necessary  to  search  only 
the  antercom  queue  of  k  jobs  each  time  core  is  made 
available.  A    job  from  the  hallway  queue  is  allowed  to  enter 
the  antercom  queue  each  time  core  is  allocated  to  one  of  the 
k  jobs  in  the  anteroom  queue.   Now  the  question  of  interest 
is:   What  value  of  k.  (what  size  of  anteroom)  will  provide 


276 


reasonable  service  for  large  jobs  without  serious  loss  of 
efficiency?   Models  similar  to  those  presented  in  this 
section  could  be  used  as  a  tool  by  managers  in  determining 
an  answer  to  this  guestion. 

Consider  the  model  depicted  in  Figure  6.5.   As  in 
section  3,  C  is  the  computer  and  the  model  depicted  in 
Figure  .6.2  is  analyzed  (ala  section  2)  to  determine  its 
properties.   Q  represents  the  anteroom  gueue  and  will 
contain  ax  mcst  k  jobs.   Conceptually  box  D  contains  the 
hallway  gueue  and,  perhaps,  a  source  and  sink  for  jcbs  or  a 
job  type  changing  facility.   In  the  models  discussed  here, 
the  contents  cf  D  will  be  represented  by  a  single  number,  if 
at  all. 

Suppose  that  the  computer  has  a  total  of  H  units  of  core 
available  fcr  users'  jobs.   (The  unit  of  storage  could  be  a 
byte,  1K  bytes,  100K  bytes,  a  page,  or  some  other 
appropriate  unit.   M  is  considered  to  be  an  integer,  as  is 
the  number  of  units  reguired  by  a  single  job.)   The  jots  may 
be  classified  according  to  the  number  of  units  of  core 
reguired.   Fcr  example,  suppose  that  type-i  jobs  require  i 
units  of  core  for  i  =  1,  2,  ...,  M.   In  practice  there  may 
be  many  values  of  i  for  which  no  jobs  ever  exist,  and  seme 
other  numbering  scheme  may  be  more  practical.   The  type  of 

each  job  is  determined  from  a  probability  distribution, 

M 

C<3.}.  „• 
l  1=1 

Note  that  it  is  possible  for  there  to  be  core  available 
even  though  the  anterocm  gueue  is  full  (i.e.,  contains  k 
jobs) .   For  example,  the  anteroom  gueue  could  be  filled  with 
large  jobs  when  a  small  job  leaves  C. 

Each  time  a  job  leaves  C,  the  anteroom  gueue  is  searched 
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for  the  first  job  which  will  fit  into  the  available  core. 
If  one  is  fcund,  it  enters  C  and  a  job  from  the  hallway 
queue  joins  the  anterocm  queue  at  the  back.   If  core  is 
still  available,  the  anteroom  queue  is  searched  again.   This 
procedure  continues  until  either  no  core  is  available  or  no 
job  in  the  anteroom  queue  will  fit  into  available  core. 
Frcm  this  description  it  is  important  to  notice  three 
things.'  First,  the  number  of  jobs  at  C  is  variable. 
Second,  two  cr  more  jobs  can  enter  C  simultaneously,  but 
jobs  enter  C  cnly  when  a  job  leaves  C.   And  third,  a  job  can 
simultaneously  enter  the  anteroom  queue  from  the  hallway 
queue  and  enter  C  without  delay  in  the  anteroom  queue. 

One  way  cf  representing  the  states  of  this 
core-allccaticn  model  is  with  a  vector  of  length  M  +  k  +  1: 

(n  ,n  ,...,n  ,t  ,t  ,...,t  ,h).   For  i  =  1,  2,  ...,  H,  n   is 

1   2      H   1   2      k  i 

the  number  of  type-i  jobs  at  C;  for  j  =  1,  2,  ...,  k,  t   is 

J 

the  type  cf  job  in  the  j-th  position  in  the  anteroom  queue; 

h  is  the  number  cf  jobs  in  the  hallway  queue.   Note  that  the 
type  of  a  jot  in  the  hallway  queue  does  not  matter  until 
that  job  enters  the  anteroom  queue.   Thus,  as  jobs  enter  the 

H 

anteroom  gueue,  the  probability  distribution  {q . } . _   is 

applied. 

If  the  model  is  closed,  jobs  leaving  C  immediately  join 

the  hallway  queue.   In  this  case  h  need  not  be  retained  as  a 

part  of  the  state  description  since  it  can  always  be 

computed  from  the  total  number  of  jobs  in  the  system  ( N) , 

the  number  cf  jobs  in  the  anteroom  queue  (usually  k) ,  and 

the  number  cf  jobs  at  C  (the  sum  of  the  n  ' s) .   If  N  is  so 

i 

large  that  there  are  always  k  jobs  in  the  anteroom  queue,  N 
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need  not  be  specified  since  its  importance  is  primarily  to 

indicate  whether  or  not  jobs  are  available  in  the  hallway 

queue  when  a  vacancy  occurs  in  the  anteroom  queue.   If  N  is 

relatively  small  and  there  are  m  <  k  jobs  in  the  anteroom 

queue,  then  t    =t    =...=t   =  0  by  convention. 
m+1     ra+2  k 

If  the  mcdel  is  open,  jobs  leavinq  C  disappear  from  the 
system  and  new  jobs  are  created  (probably  according  to  some 
interarrival  time  distribution)  at  D  and  enter  the  hallway 
queue,  the  anteroom  queue  or  the  computer  depending  upcn  the 
state  of  the  system.   In  this  case  it  is  important  to  retain 
h.   The  potential  exists  in  the  open  model  for  the  entire 
system  tc  empty  cf  jobs.   The  potential  also  exists  for  the 
number  of  jots  to  become  infinite.   Thus,  in  order  to  retain 
a  finite  model,  it  is  necessary  to  restrict  the  size  of  the 
hallway  queue. 

The  chcice  of  core-allocation  scheme  becomes  most 
critical  when  the  system  is  very  busy.   When  there  are  few 
jets  in  the  system,  turnaround  time  for  user  jobs  is  usually 
not  a  problem  and  core  utilization  does  not  approach  1C0£  no 
matter  what  scheme  is  used.   Hence,  even  though  it  may  be 
mere  realistic  tc  consider  an  open  model  than  a  closed  model 
(after  all,  real  systems  do  occasionally  go  idle)  ,  a  closed 
model  (with  ft  large  and  unspecified  and  h  omitted  from  the 
state  vector)  is  probably  a  good  choice  for  investigating 
the  question  posed  above.   Hereafter,  assume  a  closed  system 
with  N  large  enough  (though  unspecified)  to  insure  that 
there  are  always  k  jobs  in  the  anteroom  queue. 

Note  that  (n  ,n  ,...,n  ,t  ,t  ,...,t  )  must  satisfy  the 
12       a   1   2       k 

following  constraints  to  represent  a  valid  state  of  the 
system  descrited  atove: 


280 


Rule  1:   n   >  0  for  i  =  1,  2,  ...,  M. 

i 

Rule  2:   m  =  H  -   ?  in   >  0. 

i 
i=1 

Rule  3:   m  <  t   <  M  for  j  =  1 ,  2,  ...,  k. 

J 

The  number  in  calculated  in  Rule  2  is  the  core  available  for 
other  user  jots.   The  sum  in  that  equation  is  the  total  core 
in  use. 


Let  I  =  (M  +  s)/2  where  s  =  1  if  a  is  odd  and  s  =  2  if  M 
is  even.   Then  since  the  total  number  of  units  of  core 
available  for  user  jobs  is  M, 


M 

2   n   <  1 
i 
i=L 


Hence  the  state  vector  can  be  shortened  by  replacing  the 

ccuponents  n  ,    n    ,  .  ..,  n   by  a  single  component  t  ,  where 
I    L+1         M  0 

t   =  i  fcr  i  =  L,    L  +  1,  ...,  a  if  there  is  a  type-i  job  at  C 
0 

and  t   =  0  if  nc  jcb  requiring  L  or  more  units  of  core  is  at 
0 

C.   If  k  >  M,  the  stare  vector  can  be  further  reduced  by 

replacing  the  components  t,t,...,t   by  s  ,  s  ,  ....  s  , 

12        k      1    2         M 

where,  for  each  i  =  1 ,  2,  . .  . ,  M ,  s   is  the  position  of  the 

i 

first  type-i  ~ob  in  the  anteroom  queue.   If  there  is  nc 

type-i  jcb  in  the  anteroom  queue,  s   =  k  +  1.   With  such  a 

i 

description,  the  type  of  a  particular  job  .would  not  become 

kncwn  until  it  became  the  first  job  of  its  type  in  the 
anteroom  gueue.   To  determine  the  type  of  a  particular  job, 
it  is  necessary  to  use  a  conditional  distribution  derived 
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H 

from  {g  }    .   Tor  example,  if  s   =  1,  s   =  2,  s   =4  and  s 
i  i=1  M       1       3  i 

>  4  for  i  #  1,3,M,  the  third  job  in  the  anteroom  queue  is  of 

type  one  with  probability  g  /  (g  +g  )  and  of  type  M  with 

1    1   M 

probability  g  /(g  +  g  )  .   Note  that  it  cannot  be  of  type 
M    1   M 

three,  for  example,  since  the  first  type  three  job  is  in 
position  four. 


Example  6.2 

Suppose  that  only  two  types  of  jobs,  large  and  snail, 

utilize  the  system,  and  that  the  large  jobs  require  twc 

units  of  core  and  the  small  jobs  only  one.   Assume  that  the 

system  is  busy  enough  than  there  are  always  k  jobs  in  the 

anteroom  gueue.   A  state  of  the  system  can  be  specified  by  a 

vector  (n  ,n  ,s  ,s  )  where  n   is  the  number  of  type-i  "obs 
12   12         i 

in  the  computer  and  s   is  the  position  of  the  first  type-i 
jet  in  the  anteroom  gueue.   Since  only  two  types  of  jobs  are 

under  consideration,  further  simplification  is  possible. 

First,  if  n   is  specified,  n   is  the  largest  integer  less 
1  2 

than  or  egual  to  (M-n  )/2.   If  this  were  not  the  case,  there 

would  be  two  or  more  units  of  core  available  and  a  job  would 

enter  the  computer  from  the  anteroom  queue.   Second,  s   and 

1 

s   are  net  bcth  required.   For  if  s  =   3,    s      =1  and  the  job 
2  12 

in  the  second  position  is  of  type  two.   On  the  other  hand, 

if  s   =1,  the  first  type-two  job  is  in  position  two  with 
1 

probability  c  ,  position  3  with  probability  g  g  ,  ..., 
"2  12 

k-2 

position  k  with  probability  g    g  ,  and  not  in  the  anteroom 

r  ^    2 
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k-1 

queue  with  probability  q    .   Thus,  a  state  is  completely 

specified  by  the  ordered  pair  (n  ,s  )  .   Note  that  at  mcst 

one  unit  cf  cere  can  be  idle,  and  that  this  can  happen  only 

when  there  are  no  type-one  jobs  in  the  anteroom  queue. 

A  program  which  generates  the  states  and  the  balance 
equations  for  this  model  has  been  written  in  FORTRAN.   The 

input  parameters  are  q    ,    M#  k,  and  for  j  =  0,  1,  ...,  H  the 

probability,  EETA(1,j),  that  a  type  one  job  is  next  to  leave 

C  given  that  n  =  j.   The  states  are  sequenced  according  to 

a  lexicographic  ordering  of  the  state  vectors  when  arranged 

as  (s  , n  ) .   With  this  sequencing  procedure  the  coefficient 
1   1 

matrix  is  nearly  triangular.   Thus,  a  direct  elimination 

method  is  used  to  solve  the  balance  equations.   A  weakness 
of  the  program  is  that  it  effectively  assumes  that  the  mean 
time  between  departures  from  C  is  one  time  unit,  independent 
of  the  state  cf  the  system.   This  difficulty  could  be 
overcome  by  letting  B£TA(i,j)  be  the  rate  at  which  type  i 
jobs  leave  C  given  that  n   =  j,  and  making  BETA (2, j)  input 

parameters  too.   The  following  table  lists  the  steady-state 

probability  of  having  an  idle  unit  of  core  as  a  function  of 
the  size  k  cf  the  anteroom.   The  values  of  the  ether 
parameters  are  g   =  .25,  H  =  6  and  BETA(1,j)  =  n  /(n  +n  ). 

(Note  that  a  newly  arriving  job  is  three  times  as  likely  to 

be  a  large  job  as  a  small  one.) 


k        123456789 
Prob.   .35   .29   .24   .21   .19   .17   .15   .14   .13 
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Note  that  since  no  more  than  one  of  the  six  core  units 
available  can  be  idle  at  one  time,  utilization  of  core  is 
better  than  what  appears  to  be  indicated  in  this  table.   For 
example,  with  k  =  3  utilization  of  core  is  at  [1.0  - 
(.24/6)  ]  =  0.S6.   Increasing  k  to  7  increases  utilization 
cnly  to  0.975. 

In  the  first  section  of  this  chapter  the  concepts  cf 
decomposition  and  aggregation  were  introduced,  and 
conditions  under  which  these  concepts  might  be  used  to  aid 
in  analyzing  a  model  which  has  a  central-server  submodel 
were  discussed.   The  second  section  discussed  more 
specifically  how  to  determine  the  properties  of  a  queue 
which  is  an  aggregation  of  the  central-server  submodel. 
Finally,  the  last  two  sections  were  devoted  to  examples  of 
situations  in  which  these  techniques  might  be  applied. 
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VII.   SUMMARY  AND  REQUIREMENTS  FOR  FURTHER  RESEARCH 


This  final  chapter  consists  of  a  brief  review  of  the 
first  six  chapters  with  special  emphasis  upon  areas  in  which 
further  research  and  development  are  needed.   The 
organization  is  by  chapter. 


1.   CHAPTER  I 


Chapter  I,  -after  an  introduction  to  the  motivation 
behind  the  work  that  this  thesis  represents  and  a  preview  of 
the  contents  of  the  thesis,  was  used  to  establish  the 
terminology  and  notation  used  throughout  and  to  present  a 
rather  extensive  review  of  gueuing-network  literature, 
particularly  as  applied  to  computer  applications.   In  the 
literature  review  several  areas  were  pointed  out  where 
further  research  is  reguired. 

One  such  area  concerns  the  eguilibrium  output  process  of 
a  single  gueue.   To  be  specific:    "What  is  the  output 
process  of  a  gueue  having  given  characteristics  (number  of 
servers,  gueuing  discipline,  capacity  and  service 
distributions)  given  a  particular  input  process?"   This 
guestion  is  probably  too  general  to  answer  except  on  a 
case-by-case  basis.   In  view  of  the  variety  of  types  of 
gueues  which  lead  to  satisfaction  of  the  local-balance 
conditions  *hen  incorporated  into  a  network  of  gueues  (see 
the  discussion  of  the  work  of  3askett,  Chandy,  Muntz  and 
Palacios  [6]  in  subsection  5.3   of  Chapter  I),  it  would  be 
natural  to  start  with  cases  in  which  the  interarrival  and 
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service  times  are  generalized  Erlangian,  the  queuing 
discipline  is  FS,  IS  or  LCFSPR,  and  the  capacity  is 
infinite.   The  author  suspects  that  the  "interdepartur e" 
times  will  re  found  to  be  generalized  Erlangian  also, 
perhaps  differing  with  job  type,  and  that  independent 
departure  streams  will  result  from  independent  arrival 
streams.   However,  this  is  purely  conjecture  since  no  work 
has  yet'  teen  attempted  in  this  area.   If  positive  results 
concerning  the  departure  process  can  be  attained  for  queues 
of  this  type  and  also  for  queues  which  lead  to  violation  of 
the  local-balance  conditions  when  incorporated  into  a 
network,  a  comparison  may  shed  new  light  on  fundamental 
differences  ketween  queue  types  which  effect  local  balance; 
or  a  simplified  solution  form,  such  as  a  product  form,  may 
be  detected  for  networks  of  queues  which  do  not  satisfy 
local-balance  conditions. 

This  last  point  leads  to  another  area  where  more  v 
research  is  required.   To  date  (to  the  author's  knowledge) 
there  is  no  set  of  necessary  and  sufficient  conditions  for 
existence  of  a  product-form  solution.   Reversibility  is 
sufficient  but  not  necessary.   Except,  possibly  for  the 
quasi-reversible  models  of  Kelly  [64],  a  product-form 
solution  has  been  exhibited  only  for  networks  satisfying  the 
conditions  of  local  balance.   However,  local  balance  has  not 
been  shown  to  be  either  sufficient  or  necessary,  though  it 
may  be  both.   On  the  other  hand,  it  is  quite  possible  that 
there  are  networks  which  do  not  satisfy  the  local-balance 
conditions  and,  yet,  which  have  a  product-form  solution.   A 
characterization  is  needed  for  networks  which  admit  such  a 
solution. 

Investigation  into  the  relationship  between 
quasi-reversitility  and  the  local-balance  conditions  may 
shed  some  light  in  this  area.   This  relationship  is  not 
currently  known.   Since  the  definition  of 
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quasi-reversibility  given  in  [64]  is  in  terras  of 
independence  of  departure  processes,  it  is  not  clear  whether 
this  concept  can  be  extended  to  apply  to  closed  networks. 
This  is  also  an  area  which  bears  more  research. 

Although  product-form  solutions  of  the  type  discussed  in 
Chapter  I  have  received  much  attention,  the  possible 
existence  of  ether  solution  forms  should  not  be  ignored. 
This  possibility  has  yet  to  bear  fruit.   However,  the  reason 
may  be  that  it  has  not  been  explored. 


2.   CHAPTER  II 


In  Chapter  II  the  problems  associated  with  determining 
the  steady-state  properties  of  a  variety  of  Harkovian 
queuing  networks  were  attacked.   The  topics  covered  include 
representation,  generation  and  storage  of  the  states  of  the 
system  and  generation  and  storage  of  the  balance  equations. 
The  networks  considered  include  multiple  job  types  and 
queues  having  a  variety  of  possible  queuing  disciplines  (as 
defined  in  subsection  4.2  of  Chapter  I),  multiple  servers, 
generalized  Erlangian  service  distributions  and  finite 
capacity . 

The  representation  of  states  of  such  systems  as  integer 
vectors  provides  enough  information  within  the  vector 
representation  to  differentiate  states  from  one  another.   It 
also  leads  to  a  convenient  method  of  generating  the  balance 
equations.   The  concept  of  constrained  lexicographic 
sequencing  of  integer  vectors  then  provides  the  key  to 
generation  of  the  states  and  efficient  storage  of  the 
balance  equations. 

Further  work  could  be  done  in  topics  covered  in  Chapter 
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II  as  other  generalizations  arise.   For  example,  the  concept 
of  bulking  was  introduced  in  subsection  2.7   (of  Chapter  II) 
and  dealt  with  briefly  in  subsections  3.9  and  4.7.   More 
detailed  work  will  be  required  before  bulking  is  actually 
incorporated  into  a  irodel.   However,  there  is  no  need  to 
invest  this  effort  until  an  application  is  found. 

Another  example  is  job-type  switching.   This  could  be 

incorporated  into  the  branching  probabilities  if  switching 

is  allowed  while  a  job  is  in  transit  between  gueues.   For 

example,  p        could  be  the  probability  that  a  type-i  job 
i,j;k,m 

completing  service  at  queue  j  enters  queue  m  as  a  type-k 

job.   This  kind  cf  job-type  switching  is  considered  by 

Baskett,  Chandy,  Muntz  and  Palacios  in  [6], 

As  a  final  example,  such  non-Marxovian  generalizations 
as  correlated  branching  could  be  handled  by  appropriate 
choice  cf  vector  representation.   For  example,  if  the  choice 
of  branching  distribution  from  the  CPU  to  the  PP's  in  the 
central-server  model  depends  upon  which  PP  the  last  jot  to 
leave  the  CPU  routed  to,  the  addition  of  a  component  to 
carry  this  information  leads  to  the  possibility  of  applying 
the  techniques  discussed  in  Chapter  II. 

The  models  discussed  throughout  the  thesis  assume  zero 
transit  time  between  queues.  Poisner  and  Bernholtz  [87,88] 
have  examined  models  which  would  satisfy  the  local-balance 
conditiors  except  that  transit  times  have  arbitrary 
distributions.  The  possibility  of  extending  their  results 
to  more  general  models,  such  as  those  discussed  in  Chapter 
II,  remains  an  open  question  and,  possibly,  a  fertile  area 
for  further  research. 
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3.   CHAPTER  III 


Chapter  III  discussed  a  variety  of  numerical  methods 
which  could  fce  used  to  solve  the  balance  equations  resulting 
from  application  of  the  methodology  presented  in  Chapter  II. 
The  Gauss-Seidel  method,  discussed  in  subsection  4.3   (of 
Chapter  III) ,  is  the  method  chosen  by  the  author.   Reasons 
for  this  choice  have  been  given  in  Chapter  III. 

Many  cf  the  models  encountered  when  the  network  has  a 
central-server  structure  are  cyclic  in  nature.   In  fact,  the 
three  models  which  the  author  has  programmed  have  period  two 
(i.  e.f  they  are  "2-cyclic"  in  the  terminology  of  Chapter 
III) .   Chapter  III  contains  a  proof  that  the  Gauss-Seidel 
method  converges  to  the  steady-state  probability 
distribution  in  these  2-cyclic  cases  so  long  as  the  states 
are  arranged  in  a  rather  natural  order  (the  third  canonical 
form) .   It  is  conjectured  further  that  the  Gauss-Seidel 
method  will  converge  to  this  distribution  no  matter  what  the 
order  of  the  states  (see  the  conjecture  in  subsection  4.3). 

This  conjecture  introduces  an  area  wnich  could  use 
further  research.   Also  included  in  this  area  is  the 
guestion  cf  when  convergence  of  the  Gauss-Seidel  method  can 
be  proved  for  k-cyclic  queuing  models.   It  was  shown  in 
Chapter  III  that  convergence  can  be  proved  if  the  rate 
matrix  has  first  canonical  form,  and  that  lack  of 
convergence  can  be  expected  if  the  rate  matrix  has  second 
canonical  fcim  (unless  k  =  2) .   However,  if  a  canonical 
form,  similar  to  the  third  canonical  form  for  2-cyclic 
models,  is  found  to  arise  naturally  for  the  general  k-cyclic 
model,  convergence  of  the  Gauss-Seidel  method  for  this 
canonical  form  is  still  an  open  question. 

Another  area  which  is  open  for  further  research  is 
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acceleration  of  the  convergence  of  the  Gauss-Seidel  method. 
Twc  possible  methods  for  doing  this  are  suggested  in 
sutsecticn  4.4   (of  Chapter  III):    a  relaxation  procedure 
and  an  "informed"  choice  of  starting  vector.   Neither  method 
has  been  adeguately  explored. 

Other  possible  solution  methods  might  also  bear  further 
examination .   For  example,  specialized  codes  for  solution  of 
linear  prcgramming  problems  have  been  considered.   If  cne  of 
the  balance  equations  is  chosen  to  be  the  objective  function 
and  the  ncrmality  condition  and  the  other  balance  equations 
make  up  tte  constraints,  the  steady-state  probability 
distribution  will  be  the  only  solution  to  the  minimization 
problem.   Unf ortunately,  the  fact  that  this  is  a  square 
problem  (the  number  of  constraints  equals  the  number  of 
variables)  makes  a  linear  programming  approach  a  poor  choice 
given  current  technology.   The  reason  is  that  the  simplex 
method  would  require  inversion  of  the  whole  matrix,  rather 
than  some  relatively  small  submatrices  as  is  usually  the 
case  with  linear  programs.   The  advantages  of  the  iterative 
techniques  described  in  Chapter  III  over  the  direct 
technique  used  in  the  simplex  method  have  already  been 
mentioned.   Many  cf  the  problems  of  interest  are  not  known 
to  have  any  kind  of  special  structure. 

i 

4.   CHAPTER  IV 


Chapter  IV  reported  the  results  of  three  models  which 
have  been  programmed  using  the  methodology  discussed  in 
Chapters  II  and  III.   Two  job  types  are  assumed  by  each 
model,  the  types  being  differentiated  by  their  service  rates 
and  branching  probabilities.   Among  the  queues  represented 
in  the  three  models  are  found  an  IS  queue,  a  PS  queue  and 
both  single-  and  multiple-server  FCFS  queues. 


290 


An  example  in  section  2  (of  Chapter  IV)  showed  the  type 
of  errors  which  could  result  from  making  simplifying 
assumpticns.   Unfortunately,  many  reasonable  models  could 
result  in  unmanageably  large  state  spaces  if  such 
assumpticns  are  not  made.   In  subsection  2.2   ways  of 
approximating  a  network  which  violates  the  local-balance 
conditions  with  one  which  satisfies  them  were  considered. 
The  results  for  the  approximations  considered  were 
discouraging.   Mere  research  is  required  in  this  area. 

Related  to  this  is  work  currently  underway  by  Lehoczky 
and  Gaver.   Ey  switching  to  a  modified  PS  queuing  discipline 
and  adjusting  the  service  rates,  they  have  developed  a 
technique  for  reducing  the  size  of  the  state  space.   In  this 
way  problems  which  would  otherwise  be  too  big  can  be  reduced 
to  a  manageable  size.   Comparisons  between  runs  of  the  FCFS 
program  and  runs  using  their  approximation  method  reveal 
excellent  agreement  in  idleness  probabilities  and  average 
occupancies  at  the  various  queues.   Publication  of  the 
method  and  the  results  is  forthcoming. 

Another  area  where  further  research  is  required  is  the 
inaccuracies  resulting  from  application  of  Norton's  theorem, 
as  discussed  by  Chandy,  Herzog  and  Woo  [21],  to  models  which 
do  not  satisfj  the  local-balance  conditions.  For  example,  a 
Norton's  tnecrem  analysis  could  be  applied  to  a  FCFS  mcdel 
and  tne  results  compared  to  a  run  of  the  same  case  using  the 
FCfS  program. 


5.   CHAPTER  V 


The  possibility  of  bias  in  data  gathered  by  software 
monitors  was  discussed  in  Chapter  V.   The  author  knows  of  no 
other  work  in  which  this  problem  has  been  mentioned,  much 
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less  examined.   In  that  chapter  a  procedure  was  discussed 
for  using  a  queuing  model  to  estimate  the  results  which 
wculd  be  received  from  use  of  two  different  types  of 
software  monitors.   Auxiliary  results  from  application  of 
this  procedure  could  then  be  used  to  measure  the  extent  of 
the  bias.   This  could  be  used  to  help  explain  differences 
between  measures  of  performance  as  determined  using  a 
software  monitor  and  those  determined  using  an  analytic 
queuing  mcdel. 

This  procedure  has  not  yet  been  applied  to  any  numerical 
example.   This  is  an  area  open  for  further  work.   Also, 
ether  models  cf  software  monitors  have  yet  to  be  examined  in 
this  context. 

However,  a  potentially  mere  important  area  of  research 
is  the  dual  cf  the  question  considered  in  Chapter  V:    "How 
are  true  system  parameters  and  measures  of  performance  to  be 
determined  frcm  software  monitor  data  which  is  known  tc  be 
biased?"   This  question  has  not  been  addressed.   On  the 
other  hand,  a  related  question  can  be  attacked  using  the 
procedure  given  in  Chapter  V  (assuming  the  queuing  model 
used  is  an  accurate  representation  of  the  system  modelled) . 
It  is;    "What  is  the  effect  of  using,  in  a  queuing  model, 
system  parameters  estimated  from  data  gathered  by  a  biased 
software  monitor?"   Much  research  remains  to  be  done. 


6.   CHAPTIR  VI 


In  Chapter  VI  some  potential  uses  of  the  central-server 
model  as  a  submodel  were  discussed.   In  general,  the  method 
discussed  involves  replacing  the  central-server  submodel 
with  a  single  aggregate  queue  having  state-dependent 
exponential  servers.   Determination  of  the  rates  of  service 
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at  this  aggregate  queue  are  based  upon  the  steady-state 
solution  of  the  central-server  submodel  for  each  of  the 
feasible  job  mixes. 

As  two  examples  of  cases  in  which  this  technique  might 
be  useful,  the  tape-mount  problem  and  the  core-allocation 
problem  were  discussed.   The  former  problem  involves 
consideration  of  the  time  spent  by  operators  in  mounting  the 
tapes  required  by  user  jobs.   The  latter  problem  involves 
consideration,  and  possible  comparison,  of  different  methods 
for  deciding  which  of  the  various  jobs  waiting  in  the  input 
queue  of  a  computer  will  be  allowed  entry  when  core  becomes 
available.   The  two  job  types  considered  require  different 
amounts  cf  cere.   Thus,  the  problem  becomes:    "When  a  small 
amcunt  of  core  beccmes  available,  should  a  job  requiring 
that  amount  cf  core  be  allowed  to  bypass  earlier-arriving 
jobs  which  require  more  core  than  is  currently  available?" 

The  application  of  the  concepts  of  decomposition  and 
aggregation  to  models  in  which  a  central-server  model, 
representing  a  computer,  is  a  submodel  has  only  been 
outlined  in  Chapter  VI.   The  author  knows  of  no  earnest  work 
in  this  area.   The  core-allocation  problem  discussed  here  is 
particularly  ripe  for  development  and  extension. 
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APPENDIX  A 


THEOEETICAL  DEVELOPMENT  OF  THE  EALANCE  EQUATIONS 


In  this  appendix  theoretical  considerations  are  outlined 
which  lead  tc  development  of  the  systems  of  linear  equations 
whose  solution  yields  the  steady-stare  probabilities  for  the 
queuing  models  of  interest. 

Assume  that  the  system  of  interest  can  be  described  in 
terms  of  a  finite  number,  say  N,  of  states,  and  number  the 
states  frcm  cne  to  N.   Assume  further  that  time  is  measured 
from  a  point,  t  =  0,  at  which  a  change  in  the  state  of  the 
system  occurs. 

Define  a  random  variable  X (t)  to  be  the  state  of  the 

system  at  time  t.   By  convention  if  a  transition  occurs  at 

time  t,  X  (t)  is  the  state  resulting  from  the  transition,  not 

the  state  pricr  to  the  transition.   Also,  define  P   (t)  to 

ij 

be  the  probability  that  the  system  is  found  in  state  j  at 
time  t  given  that  it  starts  in  state  i.   That  is, 


(A.1)      P   (t)   =   Prob{X(t)  =  j  |  X(0)  =  i} 

Define  epochs,  t   =  0  <  t   <t   <  ...,  to  be  the  times 

0         1     2 

at  which  transitions  occur.   Note  that  X(t)  =  X (t  )  if  t   < 

i      i 

t  <  t    .   Then  the  random  variable  T   =  t   -  t     is  the 
i+1  i     l    1-1 
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time  between  epochs  i  -  1  and  i.   It  is  also  the  length  of 

time  that  the  system  stays  in  state  X (t    )  after  the 

i-1 

(i-  1)  -th  epcch. 


If  X(t.   )  =  k,  let  F  (x)  be  the  cumulative  distribution 
1-1  k 

function  for  T  .'  Assume  that  (for  k  =  1,  2,  ,  N)  F  (0  +  ) 

i  k 

=  0  and  that  I   depends  only  on  the  state  of  the  system 

i 

between  the  epochs  i-1  and  i.   That  is,  T   is 

i 

stochastically  independent  of  i,  each  t  ,  T   for  j  *  i  and 

J    D 

X(t  )  for  j  #  i  -  1. 

j 

Consider  a  network  of  queues  in  which  at  least  two  jobs 
can  receive  service  simultaneously.   Then,  completion  of 
service  of  a  job  will  very  likely  result  in  a  change  in  the 
state  of  the  system.   (Naturally,  hypothetical  examples  can 
be  devised  in  which  this  is  not  the  case.   However,  in  the 
more  general  cases  of  interest  here,  this  statement  holds.) 
But,  if  a  job  completes  service  while  another  job  is 
receiving  service,  future  time  between  epochs  will  be 
affected,  net  by  the  distribution  of  service  time  of  this 
latter  job,  but  by  a  conditional  distribution  of  remaining 
service  time.   That  is,  future  epochs  may  net  be  independent 
of  history.   However,  an  elegant  theory  resulting  from 
renewal  arguments  cannot  be  applied  unless  some  guarantee 
can  be  offered  that  the  independence  assumptions  are 
satisfied.   If  all  service  times  (or  at  least  the  times 
between  epochs)  are  exponentially  distributed,  these 
guarantees  begin  to  appear  since  the  residual  lifetime 
distributions  are  identical  to  the  service  distributions. 
This  is  the  so-called  raemoryless  property  of  the  exponential 
distribution.   So  the  aim  in  modelling  is  the  determination 
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of  states  such  that  the  time  between  epochs  is  distributed 
exponentially. 

So,  assume  that  a  system  has  been  modelled  such  that, 
for  each  i  =  1,  2,  ...,  N, 


(A. 2)      F  (x)   =   1  -  exp[-c  (t)x] 

i  i 


where  c  (t)  >  0  depends  upon  the  age  t  of  the  process  (i. 
i 

e.#  the  time  since  t  =  0)  but  not  the  time  x  since  the 

system  last  entered  state  i.   Note  that  this  choice  for 

F  (x)  satisfies  the  assumption  that  F  (0+)  =  0. 
i  i 

Furthermore,  the  probability,  S   (t,t+h),  that  the  system  is 

ij 

found  in  state  j  at  time  t  +  h  given  that  it  is  in  state  i 
at  time  t  approaches  zero  if  i  #  j,  and  one  if  i  =  j,  as  h 
approaches  zero.   Since  the  probability  that  the  system 
transitions  from  state  i  to  other  states  and  back  to  state  i 
again  in  time  h  goes  to  zero  as  h  approaches  zero, 

S   (t,t  +  h)  is  approximately  equal  to  exp[-c  (t)h]  for  small 
ii  i 

h  and 


(A. 3)      lim   [1  -  S   <t,t  +  h)  ]  /  h   =   c  (t) 

ii  i 

h— •  >0 


This  is  assumption  1  of  Chapter  XVII,  section  9  of  Feller 
[18]. 

Now,  define  q   (t)  to  be  the  probability  that,  if  the 
ij 
system  changes  state  from  state  i  at  time  t,  it  will 

transition  into  state  j;  or,  succinctly, 
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(A. 4)     g.  (t)   =   Prob{X(t  )  =  j  |  X  (t    )  =  i  and  t   =  t} 
ID  k  k-1  k 


Note  that  g   (t)  depends  upon  the  age  of  the  process  but  not 
lj 

the  time  the  system  is  in  state  i.   Assuming  that  g   (t)  is 

ij 
continuous, 


(A. 5)      lim  [S   (t,t  +  h)  /  h] 

ij 
h— >0 


lim  [g   (t)F  (h)  /  h] 

ij    i 
h— ->0 


c.  (t)g   (t) 

i    ij 


so  long  as  i  *  j.   This  is  Feller's  assumption  2  (Chapter 
XVII,  section  9  of  [18]). 

Based  upcn  these  two  assumptions  (and  a  third  assumption 
which  is  trivially  satisfied  here  since  the  state  space  is 
finite)  feller  derived  the  system  of  forward  differential 
eguations; 


(A. 6)      a)S       (s,t)/o)t   =   -c  (t)S   (s,t) 
ik  k    ik 


+  I      S.  .(s,t)c  .  (t)  g   (t) 

j#k 


This  system  cf  eguations  leads  to  the  system  cf  linear 
eguations  whose  solution  yields  the  steady-state 
probabilities. 

Note  that  if  s  is  fixed  at  zero, 
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(A. 7)      S.  .<0,t)   =   ?rob{X(t)  =  j  I  X  (0)  =  i}   =   P   (t) 

13  ij 


Hence,  the  system  (A. 6)  becomes: 


(A. 8)      B"  <t)   =   -c  (t)P   (t)    +  2      P   (t)c  (t)g   (t) 
ik  k    lk  ij    j     jk 

j#k  J 


Suppose  that  there  is  some  probability  distribution 

N 
{v  }     for  the  state  of  the  system  at  time  zero.   That  is, 
i  i=1 

for  each  i,  let  v   =  Prob[X(0)  =  i}  .   Then,  multiplying 

i 

(A. 8)  by  v   and  summing  over  i: 
i 


(A. 9)      P'(t)   =   -c  (t)P  (t)   +  I      P.(t)c  (t)q   (t) 

k  k    k  D     1     lk 

j*k 


where  P  (t)  is  the  sum  over  i  of  the  product  P   (t)  v  .   This 
k  ik    i 

is  not  to  say  that  the  probability  that  X  (t)  =  k  is 

independent  of  the  state  X{0),  but  rather  that  the 

unconditional  distribution  P  (t)  =  Prob{X(t)  =  k}  satisfies 

k 

the  same  differential  equation  that  the  conditional 
distribution  satisfies. 


N 
Now,  assume  that  a  steady-state  distribution  (P  } 

k  k  =  1 

exists.   Then,  as  t  approaches  infinity,  P  (t)  approaches  P 

k  k 

and  P«  (t)  approaches  zero.   P   is  the  long  run  average 
k  k 
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percentage  of  time  that  the  system  is  found  in  state  k.   If 

it  is  furthei  found  that  c  (t)  has  a  limit  c   and  q   (t)  has 

k  k      jk 

a  limit  g  ,    then  (A. 9)   becomes  the  system  cf  linear 
jk 

equations,  called  the  balance  equations: 


(A. 10)      c  P    =   5   c  q   P 

*  k     ~    J  jk  j 

D#k 


From  the  interpretation  given  above  for  the  distribution 

{P  }  it  is  seen  that 
k 


(A. 11) 


M 

5    P 
] 
k=l 


This  is  called  the  normality  condition 
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APPENDIX  B 


ALGORITHMS  OF  CHAPTER  II 


This  appendix  contains  the  algorithms  referred  to  in 
Chapter  II  and  is  not  intended  to  be  read  separately  from 
Chapter  II.   Most  of  the  algorithms  are  written  with  FORTRAN 
prcgrammiEg  in  mind.   Many  of  them  are  accompanied  by 
examples  which  demonstrate  their  use  step  by  step.   In  the 
examples  Bccan  numerals  in  parentheses  refer  to  step  numbers 
of  the  corresponding  algorithm.   Since  many  of  the 
algorithms  involve  vectors,  it  is  appropriate  to  remark  thar 
the  components  of  a  vector  are  numbered  from  left  to  rignt 
by  conventicr.   Thus,  the  first  component  is  the  leftmost 
component  and  the  last  component  is  the  rightmost. 


B.1.   ALGORITHM  1:    LEXICOGRAPHIC  SEQUENCING  OF  INTEGER 
VECTORS 


Algorithm  1  is  for  the  lexicographic  sequencing  of  all 
ncnnegative  integer  vectors  satisfying  a  collection  of 
constraints . 

A  nonnegative  integer  vector  is  simply  a  vector  all  of 
whose  components  are  nonnegative  integers.   The  algorithm 
will  work  egually  well  on  other  types  of  vectors  so  long  as 
there  is  a  cne-to-one  mapping  from  the  set  of  feasible 
values  for  the  components  into  the  set  of  nonnegative 
integers . 
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The  constraints  are  simply  a  set  of  rules  which  provide 
seme  restriction  on  the  values  the  components  may  take  on. 
It  is  assumed  that  the  components  have  been  numbered  so  that 
the  first  application  of  any  given  component  may  be  made  at 
such  a  tine  that  reference  need  be  made  only  to  the  values 
of  components  to  the  left  of  the  component  affected.   For 
example,  a  constraint  that  the  value  of  the  first  component 
must  be'  egual  to  the  sum  of  the  values  of  the  second  and 
third  components  will  not  explicitly  affect  the  value  of  the 
first  component.   Rather,  the  second  component  will  be 
restricted  to  be  less  than  or  equal  to  the  first,  and  the 
third  component  will  be  restricted  to  be  egual  to  the 
difference  between  the  first  two.   (For  simplicity,  "the 
i-th  component"  will  often  be  used  to  mean  "the  value  of  the 
i-th  component"  when  the  context  permits.) 

Throughout  this  algorithm,  k  and  m  refer  to  component 
numbers;  n  is  a  counter  which,  at  any  point  in  application 
of  the  algorithm,  indicates  the  number  of  vectors  which  have 
been  enumerated  to  that  point.   All  vectors  are  assumed  to 
have  fixed  length  K. 

Algorithm  1 
(i)  Let  k   =    1 ,    n  =  0  and  m  =  K. 

(ii)  For  each  value  of  k  from  its  current  value  through 
k  =  K,  let  the  k-th  component  egual  its  smallest  possible 
value  subject  to  the  constraints.   If  this  cannot  be  dene, 
go  to  (v)  . 

(iii)  Increase  n  by  1.   The  current  vector  is  the  n-th 
vector. 

(iv)  If  component  m  is  at  the  largest  value  it  can  have 
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and  still  satisfy  the  constraints,  go  to  (v)  .   Otherwise, 
increase  component  m  to  its  next  larger  feasible  value.   If 
m  =  K,  go  to  (iii) .   Otherwise,  let  k  =  m  +  1,  let  m  =  K, 
and  go  to  (ii) . 

(v)  Decrease  m  by  1.   If  m  >  0,  go  to  (iv)  .   Otherwise, 
all  vectors  satisfying  the  constraints  have  been  enumerated, 
There  are  n  such  vectors.   STOP. 


Example  E-1 

Problem:   Enumerate  all  nonnegative  integer  vectors  of 
length  K  =  3,  whose  first  component  is  larger  than  the  sum 
of  the  other  two  components,  and  the  sum  of  whose  compcnents 
is  no  larger  than  5. 

Solution : 

(i)  k  =  1 ;  n  =  0;  m  =  K  =  3. 

(ii)  k= 1 :   Since  each  of  the  second  and  third 
compcnents  can  be  no  smaller  than  zero,  and  since 
the  first  ccmponent  must  be  larger  than  the  sum  of 
the  ether  two,  the  first  component  must  be  set 
egual  tc  one. 

k=2,3:   The  other  two  are  set  egual  to  zero. 

(iii)  n  =  1;  the  first  vector  is  {1,0,0}. 

(iv)  The  third  component  is  at  the  only  value  it 
can  have  and  still  satisfy  the  constraint  that  the 
sum  cf  the  second  and  third  components  be  less 
than  the  first. 

(V)   ID  =  2. 

(iv)  The  second  component  also  cannot  be  changed 

withcut  violating  the  constraints. 

(v)  m  =  1. 

(iv)  Ccmponent  1  is  increased  to  2;  k  =  m+ 1  =  2 ;  m 
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=  K  =  3. 

(ii)  k=2,3:   Eoth  the  second  and  third  components 

are  set  equal  to  zero. 

(iii)  n  =  2;  the  second  vector  is  {2,0,0}. 

(iv)  The  third  component  is  set  equal  to  1. 

(iii)  n  =  3;  the  third  vector  is  {2,0,1}. 

(iv)  No  change. 

iv)  ■  =  2. 

(iv)  Component  2=1;  k  =  3;  m  =  3. 

(ii)  Component  3=0. 

(iii)  n  =  4;  {2,1,0}  . 

(v)  No  change. 

(v)  it  =  2. 

(iv)  No  change. 

(v)  m  =  1. 

(iv)  Component  1  =  3 ;  k  =  2 ;  m   =  3. 

(ii)  k=2,3:   components  2  and  3=0. 

(iii)  n  =  5;  {3,0,0}  . 

(iv)  Component  3=1. 

(iii)  n  =  6;  {3,0,1}  . 

(iv)  Component  3=2. 

(iii)  n  =  7;  {3,0,2}  . 

(iv)  No  change. 

(v)  m  =  2. 

(iv)  Component  2  =  1;  k  =  3;  m  =  3. 

(ii)  k=3:   component  3=0. 

(iii)  n  =  8;  {3,1,0}  . 

Repeated  application  yields: 
n=  9    {3,1,1}      n  =  10    {3,2,0}      n=11    {4,0,0} 
n=12    {4,0,1}      n  =  13    {4,1,0}      n=14    {5,0,0} 

Note,  for  example,  that  {4,0,2}  is  not  a  valid  vector 
since  the  sua   of  the  components  exceeds  5.   Also,  note  that 
if  the  braces  and  commas  are  dropped,  the  vectors, 
considered  as  three-digit  numbers,  are  in  numerical  order. 
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This  is  characteristic  of  lexicographic  ordering  of 
nonnegative  integer  vectors.   Algorithm  2  provides  a  method 
of  creating  a  three-digit  number  from  a  nonnegative  integer 
vector  of  length  three. 


B.2.   ALGORITHM  2:    TRANSLATION  OF  A  VECTOR  OF  LENGTH  K 
INTO  A  K-CIGIT  NUMBER 


Algorithm  2  is  a  computationally  efficient  method  cf 
translating  a  vector  of  length  K  into  a  K-digit  number.   It 
is  assumed  that  each  component  of  the  vector  is  a 
nonnegative  integer  less  than  ten.   Throughout  the  algorithm 
k  refers  to  a  component  number  and  m  is  used  to  accumulate 
the  K-digit  rumber. 

Algorithm  2 
(i)  Let  o  =  0. 

(ii)  For  each  value  of  k  from  one  through  K,  multiply  m 
by  10  and  add  the  value  of  component  k  to  get  a  new  value 
for  m . 

Example  E-2 

Prcblem:   Translate  {1/0,5,5,9}  into  a  five-digit  number. 

Solution : 

(i)  d  =  0. 

(ii)  k=1:   m  =  0x10  +1=1 

k=2:   m  =  1x10  +  0  =  10 

k=3:   m  =  10x10  +  5  =  1 05 
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k=4:   m  =  105x10  +  5  =  1055 
k=5:   m  =  1055x10  +  9  =  10559 

Using  different  base  number  systems,  the  restriction 
that  the  digits  te  smaller  than  ten  can  be  relaxed.   Fcr 
example,  using  100  in  the  place  of  10,  the  vector  {10,9,89} 
would  be  translated  into:   100988. 

Using  a  mcd  function,  this  procedure  can  be  reversed  to 
translate  a  given  number  intc  the  corresponding  vector  if 
the  number  tase    is  given. 


B.3.   ALG0RI1B.M  3:    STORAGE  OF  RIGHT  SUBVECTORS 


Algorithm  3  efficiently  stores  the  right  subvectors  in 
an  array  KSTG  without  generating  and  translating  them.   Each 
sutvector  is  stored  as  a  number  (as  though  Algorithm  2  had 
been  applied  to  the  vector  representations) .   Recall  that 
each  right  sutvector  is  a  nonnegative  integer  vector  of 
length  M,  the  number  of  PP's,  and  that  the  sum  of  the 
components  of  each  right  subvector  cannot  exceed  N,  the 
total  number  of  jobs  circulating  in  the  system.   The 
algorithm  requires  the  use  of  two  auxiliary  vectors,  NSTG1 
and  NSTG2,  each  of  length  N.   The  numbers  stored  in  KS1G  may 
be  separated  into  groups  according  to  the  sum  of  the  digits. 
Since  the  zerc  group  contains  only  the  number  zero,  it  is 
not  stored.   NSTG1  contains  pointers  to  the  beginning  cf 
each  group;  and  NSIG2  contains  pointers  to  the  end  of  each 
grcup.   Thus,  NSTG2(i)  =  NSTG1  (i+1)  -  1. 
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Algorithm  3 

(i)  Let  HSTG1<1)  =  1 ,  and  for  k  =  2,3,. ..,N,  let 

NSTG1  (k)  =  NSTG1(k-1)  +  C 

d-2+k,k-1 

(ii)  For  k  =  1,2,. ..,N,  let  KSTG  (NSTG 1  (k) )  =  k  and 
NS1G2  (k)  =  NSTG1(k).   Let  m  =  10  and  k  =  2. 


(iii)  Increase  NSTG2(1)  by  1,  and  let  KSTG (N3TG2  ( 1) )  = 


m. 


(iv)  Let  i  =  NSTG2(k)  +  1  and  j  =  NSIGl(k-l). 

(v)  Let  KSTG(i)  =  K3TG(j)  +  m . 

(vi)  If  j  >  NSTG2(k-1),  go  to  (vii) .   Otherwise, 
increase  i  and  j  each  by  1  and  go  to  { v)  . 

(vii)  Let  NSTG2(k)  =  i.   If  k  >  N,  go  to  (viii) . 
Otherwise,  increase  k  by  1  and  go  to  (iv) . 

(viii)  If  NSTG2(1)  +  1  <  NSTG1  (2) ,  multiply  m  by  10,  let 
k  =  2,  and  co  to  (iii) .   Otherwise,  all  right  subvectors 
have  been  stored  in  KSTG.   STOP. 

Example  E-3 


Prcblem:   Stcre  the  right  subvectors  in  a  case  with  S 
jobs  and  'A   =  3  PP's. 


=  3 


Solution : 

(i)  NSTG1  (1)  =  1,  NSTG1  (2)  =1  +  3=4,  NSIG1  (3) 

4  +  6  =  10. 

(ii)  KSTG(1)  =  1,  flSTG2(1)  =  1 
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KS1G  (4)  =  2,  NSTG2  (2)  =  4 

KSTG(10)  =  3,  NSTG2(3)  =  10,  m  =  10,  k  =  2 
(iii)  NSTG2(1)  =  2,  KSTG(2)  =  10 
(iv)  i  =  5,  j  =  1 
(v)  KSTG(5)  =  1  +  10  =  11 
(vi)  i  =  6,  j  =  2 
(v)  KSTG(6)  =  10  +  10  =  20 
(vi)  no  changes 
(vii)  NSTG2(2)  =  6,  k  =  3 
(iv)  i  =  11,  j  =  4 
(v)  KSTG(11)  =  2  ♦  10  =  12 
(vi)  i  =  12,  j  =  5 
(v)  KSTG(12)  =  21 
(vi)  i  =  13,  j  =  6 
(v)  KSTG(13)  ~    30 
(vi)  no  changes 
(vii)  NSIG2(3)  =  13 
(viii)  ffl  =  1C0,  k  =  2 
(iii)  NSTG2(1)  =  3,  KSTG  (3)  =  100 
(iv)  i  =  7,  j  =  1 
(v)  KSTG(7)  =  101 
(vi)  i  =  8,  j  =  2 
(v)  KSTG(8)  =  110 
(vi)  i  =  9,  j  =  3 
(v)  KSTG  (9)  =  200 
(vi)  no  changes 
(vii)  NSTG2  (2)  =  9,  k  =  3 
(iv)  i  =  14,  j  =  4 
(v)  KSTG  (14)  =  102 
(vi)  i  =  15,  j  =  5 
(v)  KS1G(15)  =111 
(vi)  i  =  16,  j  =  6 
(v)  KSTG(16)  =  120 
(vi)  i  =  17,  j  =  7 
(v)  KSTG (17)  =  201 
(vi)  i  =  13,  j  =  8 
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(V)  KSTG(18)  =  210 

(vi)  i  =  19,  j  =  9 

(v)  KSTG  (19)  =  300 

(vi)  no  changes 

(vii)  NSTG2(3)  =  19 

(viii)  SlOP 
The  resulting  KSTG  vector  is  {1,  10,  100,  2,  11,  20,  101, 
11C,  200,  3,  12,  21,  30,  102,  111,  120,  201,  210,  300}. 
NS1G1  =  {1,  4,  10}  points  to  the  beginning  cf  each  group  and 
NSTG2  =  {3,  9,  19}  points  to  the  end  of  each  group. 


B.4.   ALGORITHM  4:    DETERMINATION  OF  STORAGE  LOCATION  IN 
LSTG 


Algorithm  4  allows  deter  ruination  of  the  storage  loaction 

of  a  given  left  subvector  from  the  vector  representation 

itself.   This  precludes  the  necessity  cf  translating  the 

left  subvectcr  into  a  number  (say,  using  Algorithm  2)  and 

doing  a  search  through  LSTG.   Recall  that  N   is  the  number 

1 

of  ones  and  N   the  number  of  twos  in  the  left  subvector. 
2 

Throughout  the  algorithm  j  refers  to  the  component  numbers 

of  the  left  subvector,  m  refers  to  locations  in  LSTG,  and  i 
is  the  value  cf  the  j-th  component  of  the  left  subvector. 


Algorithm  4 


(i)  Let  L   =N   -  1,  L   =N,m=1  and  j  =  1 
11        2     2 


(ii)  Let  i  be  the  value  of  the  j-th  component  of  the 
left  subvectcr. 
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(iii)  If  i  =  2,  go  to  (iv) .   Otherwise,  decrease  L   by 
1.   If  L   is  negative,  go  to  (vi)  .   Otherwise,  go  to  ( v)  . 


(iv)  Increase  i  by  C        .   Decrease  I   by  1. 

L  +L  ,L  2 

1   2   1 

If  L   is  positive,  go  to  (v)  .   Otherwise,  go  to  (vi)  . 
2 


(v)  Increase  j  by  1.   Go  to  (ii) . 

(vi)  The  current  value  of  m  is  the  desired  location 
number.   STOP. 

Example  B-4 


Problem:   With  N   =2  and  N   =2  find  the  location  of 

1  2 

{1,2,2,1}  in  LSTG.   (See  Example  2.9  in  Chapter  II.) 


Solution: 

(i)  I   =  1;  L   =  2;  m  =  1;  j  =  1. 

1       2 

(ii)  i  =  1. 

(iii)  L   =  0. 
1 

(v)  j  =  2. 

(ii)  i  =  2. 

(iv)  m  =  1  +  1  =  2;  L   =0. 

2 

(v)  j  =  3. 

(ii)  i  =  2. 

(iv)  m  =  2  +  1  =  3;  L   =0. 

2 

(vi)  the  location  is  m  =  3.   STOP. 
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B.5.   ALGORITHM  5:    STORAGE  OF  VARIABLE-LENGTH  LEFT 
SUEVECTORS  IN  ISTATZral  ORDER 


Algorithm  5  is  for  storage  of  the  variable-length  left 
subvectors  when  the  states  are  ordered  according  to  a 
lexicographic  ordering  of  the  ISTATEml  vector 
representation.   Throughout  the  algorithm  k  is  the  length  of 
the  subvectors  being  stored,  m  is  the  location  in  LSTG  of 
the  last  subvector  stored,  i  is  used  as  an  index,  j  points 
to  the  end  of  a  block  of  stored  subvectors  having  common 
length,  and  NO  is  a  scratch  array  used  to  store  the  number 
of  ones  in  the  vector  stored  in  the  corresponding  position 
of  LSTG.   While  storing  the  left  subvectors  in  LSTG,  the 
algorithm  stores  pointers  in  NSTG2.   At  completion  of  the 
algorithm  component  NSTG2  (k)  +  1  of  LSTG  will  contain  the 
first  vector  and  component  NSTG2(k+1)  the  last  vector  of 
length  k.   L  is  the  number  of  servers  at  the  CPU. 

Algorithm  5 

(i)  Let  k  =  max(1,N-L).   This  is  the  smallest  number  of 

digits  in  any  number  to  be  stored  in  LSTG.   For  i  =  1,  2, 

...,  k,  let  NSTG2(i)  =  0.   Also,  let  m  =  0.   Use  Algorithm  1 

to  seguentially  generate  all  vectors  of  length  k  consisting 

solely  of  ones  and  twos  with  no  more  than  N   ones  and  N 
1  1  2 

twos.   Each  time  a  new  vector  is  generated,  increase  m  by 

one,  translate  the  vector  into  a  k-digit  number,  and  store 
the  number  in  LSTG  (m)  .  Also,  store  in  NO  (m)  the  number  of 
ones  in  LSTG  (m) .   Let  i  =  0. 

(ii)  Increase  k  by  one.   Store  the  value  of  m  in 
NSIG2  (k)  .   If  k  =  N,  both  LSTG  and  NSTG2  are  complete. 
STOP.   Otherwise,  let  j  =  m  and  proceed. 
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(iii)  Increase  i  by  one.   Let  a  =  LSTG  (i)  and  b  =  NO  (i)  . 
If  b  =  N  ,  go  to  (iv) .   Otherwise,  increase  m  by  one  and  let 

LSlG(m)  =  a  x  10  +  1.   If  k  is  less  than  N,  let  NO  (m)  =  b  + 
1.   Otherwise,  let 

Otherwise,  proceed. 


1.   Otherwise,  let  NO  (id)  =  b.   If  k  -  b  =  N  ,  go  to  (v)  . 

2 


(iv)  Increase  m  by  one  and  let  L3TG(m)  =  a  x  10  +  2. 
Let  NO(m)  =  t. 

(v)  If  i  is  less  than  j ,  go  to  (iii) .   Otherwise,  go  to 
(ii)  . 


For  an  example  see  Example  2.11(a)  in  Chapter  II. 


B.6.   ALGORITHM  6:    DETERMINATION  OF  NSTG4  FROM  NSTG1  AND 
NSIG2 


Algorithm  6  is  for  determination  of  the  values  required 
in  the  vector  NSTG4  from  the  values  already  stored  in  NSTG1 
and  NSTG2. 

Algorithm  6 

(i)  Let  i  =  1  and  k  =  max(0,N-L).   If  k  =  0,  let 
NSTG4(1)  =  1.   Otherwise,  let  NSTG4(1)  =  NSTG2(k+1)  - 
NS1G2  (k)  . 

(ii)  Let  a  =  NSTG1(i+1)  -  NSTG1(i).   Let  k  =  max(i,N-L) 
and  increase  i  by  one.   Let  NSTG4  (i)  =  a  x  [NSTG2(k  +  1)  - 
NSTG2  (k)  ], 
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(iii)  If  i  <  N,  go  to  (ii) .   Otherwise,  NSTG4  is 
complete.   STOP. 


Example  E-5 


Problem;   A  case  with  L  =  N   =  a  =  2  and  N   =  3  is 

1  2 

considered  ir  Example  2.11  of  Chapter  II.   From  part  (a)  it 

is  seen  that  NSTG1  =  {1 ,  3,  6,  10#  15,  21}  and  NSTG2  =  {0, 
0,  0,  7,  17,  27}.   Determine  NSTG4. 


Solution: 

(i)  i  =  1;k  =  3;  NSTG4(1)  =7-0  =  7. 

(ii)  a  =  3-1  =  2;k  =  3;i  =  2;  NSTG4(2)  =  2x7+7 

=  21. 

(ii)  a  =  3;  k  =  3;  i  =  3;  NSTG4(3)  =  42. 

(ii)  a  =  4;  k  =  3;  i  =  4;  NSTG4  (4)  =  70. 

(ii)  a  =  5;  k  =  4;  i  =  5;  NSTG4  (5)  =  120. 

(iii)  NSTG4  is  complete.  STOP. 


B.7.   ALGORITHM  7:    STORAGE  OF  VARIABLE-LENGTH  LEFT 
SUEVECTORS  IN  ISTATEm2  ORDER 


Algorithm  7  is  for  storage  of  the  variable-length  left 
subvectors  when  the  states  are  ordered  according  to  a 
lexicographic  ordering  of  the  ISTATEm2  vector 
representation.   Throughout  the  algorithm  r  is  the  location 
in  LSTG  where  the  next  subvector  is  to  be  stored,  s  is  the 
location  in  NSTG3  where  the  last  number  was  stored,  and  t  is 
the  length  of  the  vectors  being  stored  (also,  the  location 
in  NSTG2  where  the  last  number  was  stored) .   For  each  = 
NSIG3  (s)  is  a  location  in  LSTG  where  either  the  number  of 
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digits,  the  number  of  ones,  cr  both  change.   For  each  t 
NS1G3 (NSIG2 (t) )  is  the  location  in  LSTG  of  the  first 
sutvector  of  length  t.   Past  (ii)  i  is  the  location  in  LSTG 
of  the  subvector  to  be  considered  next  for  generating 
another  subvector.   Control  is  maintained  in  the  algorithm 
with  the  aid  cf  j  and  m. 

Algorithm  7 

(i)  Set  NSTG2(N)  =  1,  NSIG3(1)  =  1,  r  =  1,  s  =  2  and  t  = 
N  -  1  . 

(ii)  In  lexicographic  order  generate  all  vectors  of 

length  N  consisting  of  N   ones  and  N   twos.   Each  time  a  new 

1  2 

vector  is  generated,  translate  it  to  a  number,  store  the 

number  in  LS1G  (r) ,  and  increase  r  by  1 .   Set  NSTG3  (s)  =  r. 

(iii)  Set  NSTG2  (t)  =  s,  m  =  s  and  j  =  NSTG2(t+1).   Set  i 
=  NSTG3(j)  and  increase  j  by  1. 

(iv)  If  LSTG(i)  is  even,  set  LSTG(r)  =  [  LSTG  (i) /10  ], 
where  [a]  is  the  largest  integer  less  than  cr  equal  to  a, 
and  increase  r  by  1.   In  either  case,  increase  i  by  1 . 

(v)  If  i  <  NSTG3(j),  go  to  (iv)  .   If  NSTG3(s)  =  r,  go  to 
(vi)  .   Otherwise,  increase  s  by  1,  and  set  NSTG3  (s)  =  r. 

(vi)  If  j  >  m,  go  to  (vii) .   Otherwise,  increase  j  by  1, 
and  go  tc  (iv)  . 

(vii)  Set  i  =  NSTG3(j-1). 

(viii)  If  LSTG  (i)  is  odd,  set  LSTG  (r)  =  [LSTG (i) /1 0], 

and  increase  r  by  1.   In  either  case,  increase  i  by  1. 
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(ix)  If  i  <  NSTG3(j),  go  to  (viii)  .   if  NSTG3(s)  =  r ,  go 
to  (x).   Otherwise,  increase  s  by  1,  and  set  NSTG3 (s)  =  r. 

(x)  If  t  <  max(1,N  -L)  ,  go  to  (xi)  .   Otherwise,  decrease 
t  hy  1 ,  and  cc  to  (iii) . 

(xi)  If  LSTG(r)  has  been  set,  increase  r  and  s  each  by 
1,  and  set  NSTG3(s)  =  r. 


(xii)  If  I  >  N   -  1,  go  to  (xiii) .   Otherwise,  set 
1 

NSIG2(t)  =  r  for  t  =  1,2,. ..,(N  -L-1). 


(xiii)  The  procedure  is  complete.   STOP. 


Example  E-6 


Problem:   Store  the  left  subvectors  in  ISTATEm2  order  for 

the  case  that  N   =2,  N   =2,  L  =  3  and  ALEA(1,1)  =  0. 

1        2 


Solution:   (Steps  in  which  no  variable  changes  value  are  not 
listed.) 

(i)  NSTG2(4)  =  1;  NSTG3  (1)  =  1 ;  r  =  1  ;  s  =  2 ;  t  = 

3. 

(ii)  (Use  Algorithms  1  and  2) 


LSTG(1)  =  1122 

1S1G(2)  =  1212 

ISIG(3)  =  1221 

LSZ6(4)  =  2112 

LSTG(5)  -  2121 

ISIG(6)  -  2211 


r  =  2 

r  =  3 

r  =  4 

r  =  5 


r  =  6 

r  =  7;  NSTG3(2)  =  7. 

(iii)  NSTG2(3)  =  2;  m   =    2;    j  =  1  ;  i  =  1;  j  =  2. 

(iv)  Since  LSIG(1)  is  even,  LSTG (7)  =  112;  r  =  8; 


314 


1  =  z. 
iv)  LS1G  (8)  =  121;  r  =  9;  i  =  3. 
iv)  i  =  4  (LSTG  (3)  is  odd)  . 
iv)  LSTG  (9)  =  211;  r  =  10;  i  =  5. 
iv)  i  =  6. 
iv)  i  =  7. 

v)  s  =  3;  NSTG3  (3)  =  10. 
vii)  i  =  1. 
viii)  i  =  2. 
viii)  i  =  3. 

viii)  LSTG{10)  =  122;  r  =  11;  i  =  4. 
viii)  i  =  5. 

viii)  LSTG  (11)  =  212;  r  =  12;  i  =  6. 
viii)  LSTG (12)  =221;r=13;i=7. 
ix)  S  =  4;  NSTG3  (4)  =  13. 
x)  t  =  2. 

iii)  NSTG2(2)  =  4;  in  =  4;  j  =  2;    i  =  7;  j  =  3. 
iv)  LSTG(13)  =  11;  r  =  14;  i  =  8. 
iv)  i  *  9. 
iv)  i  =  10. 

v)  s  =  5;  NSTG3(5)  =  14. 
vi)  j=4. 

iv)  LSTG(14)  =  12;  r  =  15;  i  =  11. 
iv)  LSTG(15)  =  21;  r  =  16;  i  =  12. 
iv)  i  =  13. 

v)  s  =  6;  NSTG3(6)  =  16. 
vii)  i  =  10. 
viii)  i  =  11. 
viii)  i  =  12. 

viii)  ISIG(16)  =  22;  r  =  17;  i  =  13. 
ix)  s  =  7;  NSTG3  (7)  =  17. 
x)  t  =  1. 

iii)  NSTG2(1)  =  7;  a  =  7;  j  =  4;  i  =  13;  j  =  5 
iv)  i  =  14. 
vi)  j  =  6. 
iv)  LSIG(17)  =  1;  r  =  18;  i  =  15. 
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(iv)  i  =  16. 

(v)  s  =  8;  NSTG3  (8)  =  18. 

(iv)  LS1G(18)  =  2;  r  =  19;  i  =  17. 

(vi)  j  =  7. 

(v)  =  =  9;  NSTG3  (9)  =  19. 

(vii)  i  =  16. 

(viii)  i  =  17. 

(xiii)  STOP. 


3.8.   ALGORITHM  8:    DETERMINATION  OF  RELATIVE  POSITION  OF  A 
FAR  SUBVZCTOR 


Algorithm  8  is  for  determination  of  the  relative 
position  of  a  far  subvector  among  all  far  subvectors  which 
are  valid  for  given  right:  and  left  subvectors.   MAX  is  a 
vector  which  gives  the  maximum  values  which  the  component  of 
the  far  subvector  may  take  on.   That  isr  the  k-th  component 
of  the  far  subvector  can  be  no  larger  than  MAX  (k) . 
Throughout  the  algorithm  n  is  used  to  accumulate  the 
relative  position  of  the  given  far  subvector  and  k  and  m  are 
used  to  direct  control  of  the  algorithm.   Recall  that  MAX(k) 
=  1  by  cocvection  if  there  is  no  job  corresponding  to  the 
k-th  component  of  the  far  subvector.   This  will  be 
illustrated  in  the  example  following  the  algorithm. 

Algorithm  8 
(i)  Set  n  =  1  and  k  =  1. 

(ii)  Set  m  egual  to  one  less  than  the  value  of  the  k-th 
ccaponent  of  the  given  far  subvector.   If  m  =  0  or  k  =  K,  go 
to  (iv)  . 


316 


(iii)  For  i  =  k+1,  k+2,  ...,  K,  multiply  m  by  MAX(i). 

(iv)  Increase  n  by  m  and  k  by  1.   If  k  <  K,  go  to  (ii) 
Otherwise,  n  is  the  desired  number.   STOP. 

Example  E-7 


Problem:   Suppose  N   =  N   =  M  =  2.   Further  suppose  that 

1     2 

each  queue  has  a  single  server  and  a  FCFS  queuing 

discipline.   Finally,  suppose  that  the  service  distributions 
are  two-stage  Erlang  distributions  for  type-one  jobs  and 
three-stage  Erlang  distributions  for  type-two  jobs  at  each 
queue.   Determine  the  relative  position  of  {1,2,3}  among  all 
far  subvectors  which  are  valid  for  a  left  subvector 
{1,2,1,2}  and  a  right  subvector  {0,1}. 


Solution:   Since  there  are  no  jobs  at  PP1,  MAX  (2)  =  1  ty 
convention.   Since  a  type-one  job  is  in  service  at  PP2, 
MAX  (2)  =  2.   Since  there  is  a  type-two  job  in  service  at  the 
CPD,  MAX  (3)  =  3. 

(i)  n  =  1;  k  =  1. 

(ii)  m  =  0. 

(iv)  n  =  1;  k  =  2. 

(ii)  m  =  1. 

(iii)  i  =  3:   m  =  1  x  3  =  3. 

(iv)  n=1+3=4;k=3. 

(ii)  m  =  2. 

(iv)  n  =  4  +  2  =  6;  k  =  4;  this  is  the  sixth  far 

subvectcr  for  the  given  left  and  right  subvectors. 

(Note  that  the  first  five  are:   {1,1,1}/  {1,1,2}, 

{1,  1,3}  ,  {1,2,1}  ,  and  {1,2,2}  .) 
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B.9.   ALGORITHM  9;    DETERMINATION  OF  STATE  NUMBER  OF 
VECTORS  WITH  A  FAR  SUBVECTOR 


Algoritha  9  is  fcr  determination  of  the  state  number 
from  the  vector  representation  of  a  state  when  that  vector 
representation  has  a  far  subvector.   This  algorithm  uses 
KSTG  (in  which  the  right  subvectors  are  stored) ,  LSTG  (in 
which  the  left  subvectors  are  stored)  and  NSTG4.   NSTG4  is 
defined  by:   NSTG4  (i)  is  the  total  number  of  states 
preceding  tie  first  state  having  KSTG(i)  as  its  right 
subvector.   Throughout  the  algorithm  i  indicates  the 
location  in  KSTG  at  which  the  current  right  subvector  is 
stored,  j  indicates  the  location  in  LSTG  at  which  the  left 
subvector  is  stored,  k  indicates  a  location  in  LSTG,  and  n 
is  used  tc  accumulate  the  state  number.   The  meaning  of  MAX 
is  given  in  the  discussion  of  Algorithm  8. 

Algorithm  9 

(i)  Let  i  and  j  be  the  integers  such  that  KSTG(i)  is  the 
right  subvector  and  LSIG(j)  is  the  left  subvector  of  tne 
given  state.   Let  k  be  the  smallest  integer  such  that 
LS1G  (k)  is  a  valid  left  subvector  for  the  given  right 
subvector.   Let  n  =  NSTG4  (i)  . 

(ii)  Form  the  MAX  vector  to  accompany  KSTG(i)  and 
LSTG(k).   If  k  =  j,  go  to  (iv)  .   Otherwise,  increase  n  by 
the  product  cf  the  components  of  the  MAX  vector. 

(iii)  Increase  k  by  1,  and  co  to  (ii)  . 

(iv)  Use  Algorithm  8  to  determine  the  relative  position 
of  the  far  subvector  of  the  given  state  among  all  far 
subvectors  which  are  valid  for  KSTG  (i)  and  LSTG  (j)  . 
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Increase  n  by  this  number.   The  value  of  n  is  the  state 
number  of  the  given  state.   STOP. 

Example  E-8 

Problem:   Consider  the  system  discussed  in  Example  B-7. 
Find  the  state  number  for  the  state  whose  vector 
representation  has  right  subvector  {0,1},  left  subvectcr 
{1,2,1,2}  and  far  subvector  {1,2,3}. 

Solution:   KSTG  =  {1,  10,  2,  11,  20,  3,  12,  21,  30,  4,  13, 
22,  31,  40}  and  LSTG  =  {1122,  1212,  1221,  2112,  2121,  2211}. 
Each  left  subvector  can  be  used  with  each  right  subvector. 
Although  all  fourteen  values  in  NSTG4  could  be  determined, 
only  NSTG4(1)  is  needed.   The  only  states  preceding  the 
first  state  having  KSTG(1)  as  right  subvector  are  those  with 
all  jobs  at  the  CPU.   From  LSTG  it  is  seen  that  three  of 
these  states  have  a  type-one  job  in  service  and  three  have  a 
type-two  job  in.  service.   The  feasible  far  subvectors  for 
each  left  subvector  are  {1,1,1},  {1,1,2}  and,  if  a  type-two 
job  is  in  service,  {1,1,3}.   Thus,  NSTG4(1)  =  3x2    *    3x3  = 
15. 

(i)  i  =  1;  j  =  2;  k  =  1 ;  n  =  15. 

(ii)  MAX  =  {1,2,3};  n  =  15  +  6  =  21. 

(iii)  k  =  2. 

(ii)  MAX  =  {1 ,2,3}. 

(v)  Ey  Example  3-7  the  relative  position  of  the 

far  subvector  is  6.   Thus,  the  state  number  is  n  = 

21  +  6  =  27.   STOP. 
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B.10.   ALGCBI1HM  10:    CALCULATION  OF  STATE  NUMBER  FOR 
STATES  WITH  EIOCKING  SUBVECTOR 


Algorithic  10  is  for  determination  of  the  state  number 
from  the  ISTATEml  or  ISTATEm2  vector  representation  when 
this  vector  representation  includes  a  blocking  subvectcr. 
As  usual,  KSIG  contains  the  right  subvectors  and  LSTG,  the 
left  subvectors.   NSTG4(i)  is  the  number  of  states  preceding 
the  first  state  having  KSTG(i)  as  right  subvector.   In  this 
algorithm  i  is  the  number  of  blocking  subvectors  associated 
with  the  given  rignt  subvector,  and  j  is  the  position  cf  the 
given  blocking  subvector  among  them. 

Algorithm  10 

(i)  Find  the  integers  r  and  s  such  that  the  right 
subvector  is  KSTG  (r)  and  the  left  subvector  is  LSTG  (s)  .   Let 
t  be  the  integer  such  that  LSTG(t)  is  the  first  valid  left 
subvectcr  for  KSTG  (r)  .   (In  the  case  considered  in 
subsection  3.8  of  Chapter  II,  t  =  1.)   Let  k  equal  the  sum 
of  the  components  and  m  the  number  of  nonzero  components  of 
the  right  subvector. 

(ii)  If  k  =  N  -  C,  go  to  (iii) .   Otherwise,  set  i  =  1 
and  j  =  1 .   Go  to  (iv)  . 

(iii)  Let  b  be  the  binary  number  which  corresponds  to 

a 
the  (shortened)  blocking  subvector.   Set  i  =  2   and  j  =  1  + 

the  decimal  equivalent  of  b. 

(iv)  The  state  number  is  NSTG4  (r)  +  ix(s  -  t)  +  j. 
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Example  E-9 

Problem:   In  the  case  that  N   =  N   =  c  =  2  and  M  =  3  find 

1     2 

the  state  number  of  the  state  whose  ISTATEml  vector 
representation  is  {2,  1,  0,  1,  2,  1 ,  1,  2,    1,  0,  0}  . 

Solution:   In  this  case  KSTG  =  {2,  11,  2C,  101,  110,  200,  3, 

12,  21,  3C,  102,  111,  120,  201,  210,  300,  4,  13,  22,  31,  40, 

103,  112,  121,  130,  202,  211,  220,  301,  310,  400}.   Note 

that  10,  for  example,  is  not  included  in  KSTG  since  th€ 

number  of  jots  at  the  PP ' s  can  be  no  smaller  than  N  -  C  =  2. 

Also,  LS1G  =  {1122,  1212,  1221,  2112,  2121,  2211}.   For  tne 

state  given  the  right  subvactor  is  {1,0,1},  corresponding  to 

KS1G(4);  the  left  subvector  is  {2,1,1,2},  corresponding  to 

LSIG(4);  and  the  blocking  subvector  is  {1,0,0}, 

corresponding  to  a  binary  10  or  a  decimal  2.   Since  the 

length  of  LSIG  is  6  and  since  the  first  three  right 

subvectors  have,  respectively,  one,  two  and  one  nonzerc 

1     2     1 
digits,  NSTG4(4)  =6x(2   +2   +2)=  48. 


(i)  r  =  4;  s  =  4  ;  t  =  1  ;  k  =  2;  m  =  2. 

2 
(iii)  b  =  10;  i  =  2   =  4;  j  =  1  +  2  =  3. 

(iv)  The  state  number  is  48  +  4x(4-1)  +  3  =  63. 
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APPENDIX  C 


IRREEUCIEILITY  AND  PERIODICITY  IN  CENTRAL-SERVER  MODELS 


In  this  appendix  irreducibility  is  exhibited  for  a  large 
class  of  central-server  models.   The  technique  is  one  of 
indicating  a  string  of  feasible  transitions  from  an 
arbitrary  state  A  to  another  arbitrary  state  B.   In  this  way 
it  is  seen  that  the  associated  Markov  chain  is  irreducible. 
In  addition,  the  periodic  nature  of  many  central-server 
models  is  examined. 

Consider  a  central-server  model  (Figure  1.1)   consisting 

of  a  central  server  (queue  M  «■  1),  hereafter  referred  to  as 

the  CPU,  and  M  >  2  other  queues,  hereafter  referred  to  as 

PP1,  PP2,  ...,  FEM.   Suppose  that  there  are  I  job  types  and 

that  for  each  i  =  1,  2,  .  ..,  I,  there  are  N   >  0  jobs  cf 

i 

type  i  circulating  in  the  system.   Then  the  total  number  of 
jobs,  N,  circulating  in  the  system  is 

I 

n  =  y  n 

""   i 
i=1 

For  each  i  =  1,  2,  ...,  I  and  each  j  =  1,  2,  ...,  M,  let  p. 

be  the  branching  probability  of  type-i  jobs  to  PPj  after 
completion  of  service  at  the  CPU.   Since  the  network  is 
closed, 
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M 

I    P.    =   1 

3-1  13 


for  each  i  =  1,  2,    . ..,  I. 

Let  A  and  B  be  any  feasible  states  of  the  system, 
Denote  state  E  by 


B   -   (b  ,b  ,...,b  ,n  ,...,n  ) 
12       N   1       M 


where  (as  in  Chapter  II)  n   is  the  number  of  jobs  at  PEj  in 

J 

state  3f  and  the  subvector  B*  =  (b  ,...,b  )   is  a 

1       N 

permutation  cf  N   ones,  N   twos,  ...,  N   I's,  such  that  the 

1         2  I 

first  n   components  of  B'  list  the  job  types  cf  jobs  at  PP1, 

the  next  n   components  list  those  at  PP2,  and  so  forth.   The 
2 

last  n   =N-n   -  ...  -  n   components  of  B'  list  the  job 
c         1  M 

types  of  jots  at  the  CPU.   The  subvector  of  B'  corresponding 

to  each  queue  should  represent  the  job  types  of  jobs  at  that 
queue  in  an  crder  of  arrival  appropriate  to  the  state  E  and 
the  gueuing  discipline  at  the  queue.   For  example,  suppose 

n   =  3  and  PE1  contains  one  job  each  of  types  1,  2,  and  3. 

If  PP1  has  a  ES  or  IS  queuing  discipline,  (b  ,b  ,b  )  can  be 

any  permutation  of  (1,2,3).   However,  if  PP1  has  a  FCFS 

queuing  discipline  and  the  type-2  job  is  in  service,  the 
type-1  jot  next  and  the  type-3  job  last  in  state  3,  then 

(b  ,b  ,b  )  must  be  (2,1,3). 
1   2   3 
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It  is  easily  seen  that  the  representation  given  abcve 
may  not  be  unigue.   However,  unigueness  of  representation  is 
not  important  for  the  proofs  which  follow.   State  A  can  be 
similarly  represented.   But  this,  too,  is  not  important  to 
the  following  proofs. 

2!ii§2£^i9  fejl    Suppose  that  the  CPU  has  a  FCFS  gueuing 

discipline  and  a  single  server,  that  all  queues  have 

infinite  capacity  and  exponential  servers,  and  that  I  <  H 

and  the  gueues  can  be  numbered  so  that  p    #0  for  i  =  1,  2, 

ii 

...,  I.   (that  is,  jobs  of  type  i  are  allowed  to  route  to 

PPi.)   Then,  the  system  is  irreducible. 

£f22l:    Consider  arbitrary  states  A  and  B  as  described 
above,  and  start  in  state  A.   State  B  results  from  the 
following  seguence  of  feasible  transitions: 

1.  Cne  at  a  time  route  all  jobs  at  the  PP*s  to 
the  CPO. 

2.  One  at  a  time  route  all  jobs  from  the  CPU  to 
the  PP's  so  that,  for  each  i  =  1,  2,  ...,  I, 

all  type-i  jobs  route  to  PPi. 

3.  For  k  =  1,  route  a  type-b   job  from  PPb   to 

k  k 

the  CPO.   Repeat  with  k  =  2,  then  k  =  3,  and  so 

forth  to  k  =  N. 

4.  Now  route  the  first  n   jobs  to  PP1,  the  next 

1 

n   jobs  to  PP2,  and  so  forth,  leaving  the  last 
2 

n   jets  at  the  CPU. 
c 

Corollary  C-1-A    Theorem  C- 1  still  holds  if  the  CPU  has 
multiple  servers,  or  if  the  CPU  has  any  of  the  following 
gueuing  disciplines:   IS,  PS,  LCFS,  LCFSPE. 
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£^ooJ:    The  sequence  of  events  given  above  will  work  in 
each  of  these  cases  (with  obvious  changes  in  arrival  of  jobs 
at  the  CPD  in  step  3  for  LCFS  and  LCFSPR  queuing 
disciplines) . 

Corollary  C-1-E  Theorem  C- 1  still  holds  if  the  CEU  has 
a  class  priority  queuing  discipline  provided  the  PP's  can  be 
chosen  so  that  the  queuing  disciplines  at  PP1,  PP2,  ...,  PPI 
are  IS,  JCFS  cr  PS. 

Proof:    The  sequence  of  events  given  atcve  will  wcrk  if 

each  of  the  first  N  -  n   jobs  routed  to  the  CPU  in  step  3  is 

c 

immediately  routed  to  the  appropriate  PP.   Step  4  is  net 

needed. 

Corollary  Q1-C    Theorem  01  still  holds  if  the  various 
queues  have  generalized  Srlangian  service  distributions  and 
each  queue  has  either  a  FCFS,  LCFS,  IS  or  PS  queuing 
disciplice. 

££22i:    To  the  sequence  of  events  given  above  add: 

5.   Advance  each  job  in  service  to  its  appropriate 
stage  of  service. 

Note  that  many  ether  corollaries  are  possible.   Several 

of  these  should  be  obvious  to  the  reader.   For  example, 

generalized  Erlangian  service  distributions  are  possible 

with  LCFSPR,  LCFSPHpt  and  class  priority  queuing  disciplines 

provided  strict  attention  is  paid  to  the  order  of  arrival  of 

jobs  and  advancement  of  jobs  to  the  proper  stages  of 

service.   Also,  finite  capacities  are  possible  provided  PP1, 

...,  PPI  can  be  chosen  so  that  the  capacity  at  PPi  is  at 

least  as  large  as  N   +  n  .   Rather  than  consider  details  of 

i    i 
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further  corollaries  to  Theorem  c- 1 ,  consider  the  following 
theorem. 

Theorem  Z-2        Suppose  that  the  PP's  can  be  numbered  so 

that  p    #  0  for  i  =  1,  2,    .  ..,  I.   Further  suppose  that  for 
i1 

each  i  there  is  a  j  #  1  such  that  p    *  0,  and  that  each  of 

ij 

PP1  and  CPU  is  a  single  server  FCFS  queue  with  infinite 
capacity.   Then  the  system  is  irreducible. 

Proof :    Again,   start  in  state  A.   State  B  results  from 
the  following  sequence  of  feasible  transitions: 

1.  One  at  a  time  route  all  jobs  at  the  PP's  to 
the  CPU. 

2.  For  k  =  1,  route  the  job  in  service  at  the  CPO 

to  PE1  if  it  is  of  type  b  .   Route  it  to  PPj 

k 

fcr  seme  j  #  1  and  then  back  to  the  CPU  if  it 

is  net  of  type  b  .   Repeat  this  procedure  until 

k 

EE1  contains  exactly  k  jobs. 

3.  Repeat  step  2  for  k  =  2,    then  for  k  =  3,  and 
sc  forth  up  to  k  =  N. 

4.  One  at  a  time  route  all  jobs  from  PP1  to  the 
CEU. 

5.  Route  the  first  n   jobs  to  PP1,  the  next  n 

1  2 

jobs  to  PP2,  and  so  forth,  leaving  the  last  n 

c 

jefcs  at  the  CPU. 

As  with  Iheorem  C-1,  many  corollaries  can  be  stated  and 
proved.   Rather  than  doing  sc,  simply  note  that 
irreducibility  of  the  three  models  discussed  in  Chapter  IV 
as  having  been  programmed  is  assured  by  these  theorems  and 
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their  ccrcllaries  provided  there  are  at  least  two  PP's  to 
which  jobs  can  route. 

IJi§2I£iS  Qz3        If  all  service  distributions  are 
exponential,  each  state  of  the  resulting  Markov  chain  which 
can  be  reentered  is  periodic  with  an  even  period. 

E!L22£Z        Consider  a  sequence  of  states  resulting  from  a 

feasible  sequence  of  transitions,  beginning  and  ending  with 

the  same  state:   fl=A,A,A,  ...,  A    ,  A   =  B.   It  will 

0    12        k-1    k 

suffice  to  shew  that  k  must  be  even. 

Define  a  function  f  from  the  state  space  into 

the  set  cf  nennegative  integers  by: 


f  (A)   =   n   +  n 

c    b 


where  n   is  the  number  of  jobs  at  the  CPU  in  state  A  and  n 
c  b 

is  the  nun.ber  of  blocked  servers  throughout  the  network  in 

state  A. 

A  transition  from  A   to  A    can  only  take 

J      j+1 

place  when  a  jcb  completes  service  at  some  gueue.   If  the 

job  cannct  meve,  the  server  which  was  servicing  it  becomes 

blocked:  i.  €.,  n   is  increased  by  one  and  n   remains  the 

b  c 

same. 

If  the  job  can  move,  and  if  no  server  was 

blocked  because  of  the  finite  capacity  at  the  queue  at  which 

service  was  just  completed,  then  only  the  one  job  will  move. 

Because  of  the  structure  of  the  central-server  model,  the 

result  will  be  either  an  increase  or  a  decrease  in  n   by 

c 

one.      The    value   of    n      will   remain    unchanged. 

b 
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If,  on  the  other  hand,  the  movement  of  this 
job  "unblocks"  a  server  at  another  queue,  two  or  more  jobs 
will  move  simultaneously.   Because  of  the  structure  of  the 
central-server  model,  each  of  these  jobs  will  move  either 
toward  the  CFU  or  away  from  the  CPU,  resulting  in  a  change 

of  one  in  the  value  of  n   for  each  job  which  moves.   (Since 

c 

seme  of  these  changes  are  positive  and  others  are  negative, 

the  net  chance  in  the  value  of  n   is  no  more  than  one  in 

c 

absolute  value.)   For  each  job  which  moves,  except  the  job 

which  completes  service  to  initiate  the  transition,  a  server 

is  unblocked,  decreasing  n   by  one.   Thus,  the  change  in  n 

b  c 

+  n   is  one  fcr  the  job  which  completes  service  and  either 
b 

minus  two  or  zero  for  each  of  the  other  jobs.   The  net 

change  is  therefore  odd. 

Since  the  net  change  in  n   +  n   is  odd  nc 

c     b 

matter  what  the  circumstances  of  the  transition,  f (A    ) 

must  be  even  if  f(A  )  is  odd,  or  odd  if  f  (A  )  is  even. 

J  J 

Considering  the  sequence  of  transitions  given  above,  if  f (3) 

is  even,  f (A  )  is  even  for  even  j  and  odd  for  odd  j.   If 

J 
f(E)  is  odd,  f (A  )  is  odd  for  even  j  and  even  for  odd  j. 

j 

Since  A   =  B,  f  (A  )  =  f(B).   Hence,  Jc  must  be  even, 
k         k 


Since  the  author  is  interested  in  steady-state 
solutions,  the  principle  concern  is  systems  which  contain  no 
absorbing  states.   Thus,  the  following  conclusion  to  Theorem 
C-3  would  be  more  to  the  point:     "the  resulting  idarkcv 
chain,  if  it  contains  no  absorbing  states,  is  periodic  with 
even  period."   In  some  cases,  as  is  seen  in  the  following 
theorem,  more  specific  results  can  be  derived. 


328 


Theorem  C-4    If  all  service  distributions  are 
exponential  and  there  is  a  feasible  state  B  such  that  the 
CPU  is  idle  and  some  PP  has  only  one  type  of  job,  then  state 
B  has  period  two. 

£j.2oJ*    Suppose  that  PPj  contains  only  jobs  of  type  i. 
Then,  if  a  jcb  completes  service  at  PPj,  the  system 
transitions  tc  a  state  A  identical  to  B  except  that  there  is 
a  job  of  typ€  i  at  the  CPU  and  one  less  job  at  PPj.   From  A 
the  system  may  transition  back  into  B  if  the  job  at  CPU 
completes  service  and  routes  to  PPj.   (Note  that  the 
branching  probability  of  type-i  jobs  to  PPj  cannot  be  zero 
since  state  E  is  feasible.)   Thus,  the  period  of  B  is  do 
larger  than  two.   Ey  Theorem  C-3,  the  period  of  B  is  even. 
Thus,  this  period  is  two. 

The  importance  of  Theorem  C-4  is  quickly  seen  for  acdels 
which  alsc  satisfy  the  hypotheses  of  Theorem  C-1,  Theorem 
C-2  or  one  of  their  corollaries.   The  result  is  a  Markov 
chain  of  period  two  since  irreducibility  is  also  provided. 

Generalizations  of  Theorems  C-3  and  C-4  are  possible  in 
certain  mcdels  with  nonexponential  service  distributions. 
For  example,  if  the  CPU  has  two-stage  Erlang  service 
distributions,  and  if  all  PP's  have  exponential  service 
distributions,  the  period  is  divisible  by  three.   On  the 
other  hand,  aperiodic  states  result  from  many  generalized 
Erlangian  service  distributions  (since  the  job  may  complete 
service  at  any  stage)  or  from  Erlang  service  distributions 
with  relatively  prime  numbers  of  stages  at,  say,  different 
PP's. 
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APPENDIX  G 


GLOSSARY  CF  tfAJOR  VARIABLE  NAdES  IN  TH2  THREE  PHOGBAMS 


ALP  A    A  matrix  used  to  store  the  branching  probabilities. 
AIFA(i,j)  is  the  probability  that  a  type-i  job  Mill 
route  to  PPj  when  it  completes  service  at  the  CPU. 
ALFA  is  input. 

CGEF    A  vector  used  to  store  the  positive  values  of  the 
coefficient  matrix.   COEF  (i)  is  the  coefficient 
corresponding  to  state  INDEX(i)  in  the  j-th  balance 
equation  if  NC0N(j-1)  +  1  <  i  <  tfCON  (j)  .   The  values 
are  given  to  COEF,  L'ADZl    and  NCON  in  the  main 
routine.   After  the  steady-state  distribution  has 
been  determined,  some  of  the  locations  in  COE?  are 
used  in  subroutine  ACCUM  to  accumulate  the  measures 
of  system  effectiveness. 

CPU     NPE  +  1;  the  total  number  of  processors.   Processor 
CPU  is  the  central  server. 

IA3     An  error-indication  variable.   Its  value  is  checked 
tc  aocrt  the  problem  if  an  unfixabie  error  is 
detected  in  the  input  or  dimensions. 

INDEX   A  vector  used  to  store  the  state  number  of  the  terms 
in  the  balance  equations.   See  COEF. 

IFEAL   An  optional  print  indicator.   Setting  IP3AL  =  1  will 
cause  the  balance  equations  to  be  printed. 


397 


IP1STG  A  dimension  indicator.   IPLSTG  should  be  set  equal  to 
the  dinension  of  LSTG.   As  LSTG  is  being  filled  in, 
IPLSTG  is  tested  to  see  that  storage  does  not  take 
place  beyond  the  dimension  of  LSTG. 

IPPB03  An  optional  print  indicator.   Setting  IPPROB  =  1  Hill 
cause  the  steady-state  probabilities  to  be  printed. 
Only  the  first  1000  will  be  printed  if  NSTATE  >  1000. 

IPSTAT  An  optional  print  indicator.   Setting  IP5TAT  =  1  will 
cause  the  vector  representation  and  state  number  for 
each  state  to  be  printed. 

IP1     A  dimension  indicator  for  NSTG1  and  NSTG2.   Performs 
for  these  vectors  what  IPLSTG  does  for  LSTG. 

IP3     A  dimension  indicator  for  NSTG3.   See  IPLSTG. 

IPU     A  dimension  indicator  for  NSTG4  and  KSTG.   See 
ItLSTG. 

ISTATE  Vector  representation  for  the  left-hand  side  of  the 
balance  equations.   As  the  i-th  oalance  equation  is 
being  generated,  ISTATE  contains  the  ISTATE  vector 
representation  of  the  i-th  state. 

IIE8    A  running  variable  which  indicates  the  current 
problem  number.   See  NPHOB. 

JSTATE  Vector  representation  for  the  right-hand  side  of  the 
balance  equations.   As  the  i-th  balance  equation  is 
being  generated,  JSTATE  contains,  in  succession,  the 
ISTATE  vector  representation  of  state  INDEX  (j),  when 
CCEF  ( j)  is  being  filled  in,  for  each  j  from  SC0N(i-1) 
+  1  tc  NCON  (i)  . 
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KS1G    Vector  used  for  storage  of  the  right  subvectors. 

LS1G    Vector  used  for  storage  of  the  left  subvectors. 

MP      Number  of  servers  at  the  CP0  (tapes  model  only) .  ?l? 
is  input. 


N 


N1  +  &2;  the  total  number  of  jobs  in  the  system. 


NCCN    A  fcinter  vector.   NCOS  (i)  points  to  the  end  of  tho 
i-th  talance  equation  in  COEF  and  INDEX.   See  CC2F. 

NCPU    The  number  of  jobs  at  the  CPU. 

NDIG    N  -  NCPU;  the  number  of  jobs  at  the  PP's. 

NOCQEF  A  dimension  indicator  for  COEF  and  INDEX.   See 
IPL51G. 

NOP     A  dimension  indicator  for  P  and  NCON.   See  IPL5IG. 

NPE     The  number  of  PP's.   NPP  is  input. 

NPflGfl   Toe  number  cf  problems  to  oe  attempted  at  the  current 
run.   NPCOB  is  input. 

NP1     N  +  1.   See  N. 

N?2     N  +  2.   See  N. 

NSTAT2  Nuuber  of  states.   After  the  states  have  be.^n 

generated  the  first  time,  NSTATE  is  the  total  number 
or  states.  Prior  to  that  time,  NSTATE  is  tae  number 
of  states  generated  up  to  that  time. 
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NSIG1   A  pointer  vector.   NSTG1(i)  is  the  location  in  KSTG 
of  the  first  right  subvector  whose  elements  sura  to  i< 

NS1G2   A  pointer  vector.   Ose  varies  with  program. 

NS1G3   A  pointer  vector.   Use  varies  with  program. 

NSTG4   A  pointer  vector.   Use  varies  with  program. 

NTCT    N  +  NEF;  the  length  of  the  I5TATE  vector 
representation  of  a  state. 

NZCOEP  Number  of  nonzero  elements  in  the  coefficient  matrix, 
NZCOEP  is  the  length  of  COEF. 


N1 


Number  cf  type-one  jobs.   N1  is  input. 


N2 


Number  of  type-two  jobs.   N2  is  input. 


P       A  vector  used  to  store  the  state  probabilities. 

RATE         A  matrix  used  to  store  the  service  rates.   BATE  (i,j) 

is  the  rate  for  type-i  jobs  at  processor  j.   3AIE  is 
input . 
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